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CONVERSION FACTORS, U. S. CUSTOMARY TO METRIC (SI) 
UNITS OF MEASUREMENT 


U. S. customary units of measurement used in this report can be con¬ 
verted to metric (SI) units as follows: 


_ Multiply _ 

acres 

acre-feet 

cubic feet per second 
Fahrenheit degrees 
feet 
inches 

miles (U. S. statute) 
pounds (mass) per cubic foot 
square miles 
tons (2000 lb, mass) 


_ B y 

4046.873 

1233.482 

0.02831685 

5/9 

0.3048 
2.54 

1.609344 

16.01846 

2.589988 

907.1847 


_ To Obtain _ 

square metres 

cubic metres 

cubic metres per second 

Celsius degrees or Kelvins* 

metres 

centimetres 

kilometres 

kilograms per cubic metre 
square kilometres 
kilograms 


* To obtain Celsius (C) temperature readings from Fahrenheit (F) read¬ 
ings, use the following formula: C = (5/9) (F - 32). To obtain Kelvin 
(K) readings, use: K = (5/9) (F - 32) + 273.15. 
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EMPIRICAL METHODS FOR PREDICTING 


EUTROPHICATION IN IMPOUNDMENTS 
PHASE I: DATA BASE DEVELOPMENT 


PART I: INTRODUCTION 


1. This report documents the development of a data base describing 
certain water quality aspects of reservoirs operated by the U. S. Army 
Corps of Engineers (CE). The data base includes information on project 
location, morphometry, water quality, hydrology, and sedimentation. As 
part of the Environmental and Water Quality Operational Sutdies (EWQOS) 

Program being conducted by the Office, Chief of Engineers, U. S. Army, 
this work has been conducted to provide groundwork for assessing empiri¬ 
cal approaches to describing and predicting reservoir trophic status. 

2. One basic strategy employed in assembling this data base has 
been to utilize existing, centralized sources of information first. 

These have included nationwide data bases maintained by various federal 
agencies, as well as a few sources of data tabulated at a regional level. 

A framework has been designed and implemented for storing and accessing 
this information, with flexibility for updating and general access, so as 
to meet the specific objectives of this project. Having utilized cen¬ 
tralized data sources to their fullest extent, data gaps have been identi¬ 
fied and used to set priorities for locating and incorporating information 
from relatively diffuse sources, such as specific project design memoranda 
and other published or unpublished reports dealing with individual projects. 

This stage-wise data-gathering procedure has been designed with efficiency 
and cost-effectiveness in mind. 

3. Another basic strategy which has been employed in compiling 
water quality and hydrologic data has been to assemble individual obser¬ 
vations in space and time (i.e., "raw data"), rather than average values. 

This strategy accomplishes the following: 

i 







a. It provides for the broadest possible range of future 
uses of the data base. 

b. It eliminates possible variations due to the use of dif¬ 
ferent averaging procedures. 

£. It provides a basis for error analysis and assessment of 
data adequacy in future model testing. 

These advantages must be weighed against the major disadvantage of the 

approach—it involves management of a large amount of information. The 

water quality file presently contains two million observations taken at 

4451 stations located in or around 271* CE projects. 

4. Management of these types and quantities of information entails 
use of a consistent framework. One basic strategy has been to tag each 
bit of information with district, project, and data source codes. While 
the validity of the information at its original source cannot be sub¬ 
stantiated, use of a systematic approach in building the data base insures 
that the data are transferred and accessed properly. Keeping track of 
original data sources provides a means of checking any piece of informa¬ 
tion at its source and identifying discrepancies among multiple data 
sources for the same value. The latter provides one indication of data 
and source reliability. Another validity test involves checking 

for internal consistency in a given set of values. For example, the 
morphometric profiles have been tested by comparing reported volumes at 
any elevation with the integral of reported areas with respect to depth. 
The third validity test involves distribution of portions of the data 
base to district offices for verification and editing. This entails 
their cooperation and assumes that district-level sources of information 
for specific projects are the most accurate. This approach has been 
taken for upgrading the morphometric data file with reasonable success. 

5. In its current state, the data base is a collection of infor¬ 
mation in a well-defined framework. It is not a user-oriented system 
designed for frequent interactive use. Such a system would require 


* A total of 299 projects are included in the data base; no water quality 
data have been located for 28. 
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extensive software development geared to a specific computer system and 
to the intended uses of the information in various areas of reservoir 
management. The scope of this project has been limited to compiling 
the information, organizing it, and extracting portions that are directly 
relevant to analysis of eutrophication problems in reservoirs. With 
additional software development and systems programming, the data base 
could be made accessible for more generalized purposes. 

6. The complete data base consists of a collection of computer 
files, reports, data forms, and maps. As discussed above, each piece 
of information is referenced by CE district, project, and data source 
codes. Part II of this report describes the facilities, methods, and 
agency contacts used in this work. Part III describes the general 
structure of the data base. Parts IV through XI document the sources 
and approaches used in compiling each element and present data inven¬ 
tories. Parts XII-XIV summarize results of specific analyses which lay 
the groundwork for use of the data base in Phase II of this project. 

These analyses cover the following topics: (XII) numerical characteri¬ 
zation of reservoir hypsographic curves; (XIII) assessment of the 
variability of trophic state indicators in reservoirs; and (XIV) testing 
of methods for estimation of nutrient budgets. Conclusions and Recom¬ 
mendations are given in Part XV and XVI, respectively. Appendix A 
contains data inventories by project and district. 









PART II: FACILITIES AND METHODS 


7. Compilation and manipulation of the various data files docu¬ 
mented in this report have been done on an IBM 370-168 computer maintained 
by the Information Processing Center of the Massachusetts Institute of 
Technology (MIT). This facility has been used in a batch processing mode 

(OS/VS1) and in an interactive mode through IBM's Conversational Monitor¬ 
ing System (CMS). Three media have been used for data storage where 
appropriate: (1) 9-track tapes (6250 bytes per inch); (2) 3350 disc 

packs (OS and CMS); and (3) cards. Copies of the current versions of 
all files have been transferred to tapes for secure storage and future 
access. 

8. While most of the information used to assemble the data base has 
been read from tapes supplied by various agencies, some files (in par¬ 
ticular, the project lists, morphometry, and sedimentation files) have 
been assembled from tabulated data. In these cases, cards have been 

used for data entry. Keypunching has been done and verified using con¬ 
tract services offered by MIT. 

9. Programming for data manipulation and analysis is in the PL/I 

and FORTRAN IV languages. The Biomedical Computer Program package 
1 2 

(BMDP) and SAS have also been used in preliminary data analyses. Plots 
have been produced with a Calcomp line plotter. 

10. Access to the Environmental Protection Agency (EPA) STORET 
system 3 has been acquired through the cooperation of the Water Quality 
Laboratory of the New England Division of the Corps of Engineers. The 
staff of the Systems Analysis Branch of the EPA Region I Office in Boston 
has been helpful in submitting STORET retrievals. The identification of 
water quality and quantity monitoring stations has been done partially 
using the services of the National Water Data Exchange of the U. S. 
Geological Survey in Reston, Virginia. The Corvallis Environmental 
Research Laboratory of the U. S. Environmental Protection Agency has 
provided reports and data files from the National Eutrophication Survey 

4 

(NES) . Sedimentation survey sheets have been obtained through the 
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Sedimentation Laboratory of the U. S. Department of Agriculture and the 
South Technical Service Center of the U. S. Soil Conservation Service 
in Fort Worth, Texas. 

11. Staff members of the Environmental Laboratory of the U. S. 

Army Engineer Waterways Experiment Station (WES) have provided assistance 
in extracting and coding morphometric and drainage area data from project 
design memoranda and in coding water quality data complied outside of 
STORET. The Ohio River Division (ORD) of the Corps of Engineers provided 
tapes containing water quality data gathered by district monitoring 
programs in that division. 









PART III: DATA BASE STRUCTURE 


12. Figure 1 depicts the organization of the data base into 

eight major file groups: 

CODES - Data Base Codes 

LISTS - Project Lists 

WATS - Watershed Characteristics 

RESER - Reservoir Characteristics 

HYDRO - Hydrology Data 

WQ - Water Quality Data 

SED - Sedimentation Data 

NES - EPA National Eutrophication Survey Data 

Each group contains a number of computer files, data forms, and/or maps. 
File names are in two parts. The first refers to the major file group 
and the second, to the specific file within that group. For example, 
the water quality station key is given the name WQ.KEY. A lowercase 
second name indicates that the element is a map or data form (not a 
computer file). 

13. U. S. customary units have been used most extensively in the 
files. This has facilitated the transfer and verification of information, 
since most of the original sources of morphometric, drainage are*, hydro- 
logic, and sedimentation data were also in U. S. customary units. Or.e 
exception to this convention is the EPA National Eutrophication Survey 

4 

Compendium file, which was supplied by the EPA in metric units. 

14. The development, structure, and contents of each file group 
are discussed in the following sections. The sources and approaches 
used in compiling the information are described. Each file is character¬ 
ized with respect to format and content. Since most files are too large 
for listing, data holdings are summarized in an inventory format, with 
categories defined by file, variable, and CE division. Data inventories 
by project and district are included in Appendix A. Record formats for 
the files described in this report are defined as PL/I data structures. 













PART IV: CODES - DATA BASE CODES 

15. CODES consists of a group of files which define referencing 
systems used in various portions of the data base. These include the 
following: 

CODES.DIS CE District Codes 

CODES.DIV CE Division Codes 

CODES.SOURCE Data Source Codes 

CODES.TYPE Station Type Codes 

CODES.POOL Pool Codes 

CODES.OUTLET Outlet Codes 

CODES.PARAM Parameter Codes 

Listings of these files are given in Tables 1 through 5. 

16. District and division codes (Table 1) provide a numerical 
indexing system for each of 36 districts and 10 divisions, respectively. 
Districts are grouped within divisions. The New England Division is 
unique in that it is not comprised of districts. In order to permit 
referencing of all projects at the district level, district number one 
has been defined to represent the New England Division. 

17. Data source codes (Table 2) provide a referencing system for 
nine data sources which are used frequently in the data base. Identi¬ 
fying each data entry by source provides a basis for validation and 
sorting out discrepancies among multiple data sources for a given project 
and characteristic. 

18. A total of nine station type codes have been defined for use 
in the water quality and hydrology files (Table 3). These provide a 
frame of reference for locating monitoring stations within a given 
project. Broadly, these permit distinction among stations located on 
upstream tributaries, within reservoir pools, and in or below reservoir 
discharge streams. Within-pool stations are further classified as 
upper-pool, mid-pool, or near-dam. Mid-pool is used as a default for 
lake stations. The remaining two are used in cases where coordinates, 
maps, and/or station location descriptions provide an adequate basis 
for more refined classification. Secondary tributary codes (upstream 
and downstream from impoundments) have been used only for some EPA 
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National Eutrophication Survey stations to aid in hydrologic budget 
computations. 

19. Pool and outlet codes (Table 4) are used in the morphometric 
file. These provide systems for referencing various elevations to pool 
allocations for specific uses, ranges of operating levels, and locations 
and types of principal outlets. The systems were initially designed 

at WES. Additional codes have been added as needed during subsequent 
morphometric data compilation. 

20. The parameter codes file (Table 5)* is used to reference 
hydrologic and water quality data. The file contains 94 members, each 
identified by a water quality parameter code, STORET code"^, measurement 
type, and units. The 5-digit STORET code is used in retrieving water 
quality and hydrologic data from the STORET system. It is also used 

to identify measurements in the hydrology files. In addition to the 
89 basic water quality parameter codes included in the file, there are 
11 redundant parameter codes, which have been used in retrieving water 
quality data from STORET. Redundant codes result from multiple means 
of expressing a given type of observation (e.g., temperature in degrees 
C or degrees P or total phosphorus as P or as P0 4 ). Redundancies have 
been eliminated in final data storage by applying appropriate conversion 
factors in each case. 
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Table 1 


District and Division Codes 


Code 

District 

Div 

Code 

Division 

01 

Mew England 

01 

01 

New England 

02 

New York 

02 

02 

North Atlantic 

03 

Philadelphia 

02 

03 

South Atlantic 

04 

Baltimore 

02 

04 

Ohio River 

05 

Norfolk 

02 

05 

North Central 

06 

Wilmington 

03 

06 

Lower Mississippi Valley 

07 

Charleston 

03 

07 

South West 

08 

Savannah 

03 

08 

Missouri River 

09 

Jacksonville 

03 

09 

North Pacific 

10 

Mobile 

03 

10 

South Pacific 

11 

Buffalo 

05 



12 

Detroit 

05 



13 

Chicago 

05 



14 

Rock Island 

05 



15 

St. Paul 

05 



16 

Pittsburgh 

04 



17 

Huntington 

04 



18 

Louisville 

04 



19 

Nashville 

04 



20 

St. Louis 

06 



21 

Memphis 

06 



22 

Vicksburg 

06 



23 

New Orleans 

06 



24 

Little Rock 

07 



25 

Tulsa 

07 



26 

Fort Worth 

07 



27 

Galveston 

07 



28 

Albuquerque 

07 



29 

Kansas City 

08 



30 

Omaha 

08 



31 

Walla Walla 

09 



32 

Seattle 

09 



33 

Portland 

09 



34 

Sacramento 

10 



35 

San Francisco 

10 



36 

Los Angeles 

10 









Table 2 


Data Source Codes 


_ Data Source _ 

Leidy and Jenkins 

EPA National Eutrophication Survey 
District or Division 
Sedimentation Survey Sheets 
Design Memoranda 

USGS State Water Resources Data Reports 
USGS/WATSTORE File 
EPA STORET 

INFONET - Ohio River Division 


Table 3 

Station Type Codes 


Code 

Station Type 

01 

Tributary 

02 

Pool 

03 

Discharge 

04 

Pool (nr. dam) 

05 

Pool (headwaters) 

06 

Unused 

07 

Point source 

08 

Sec. trib. (downstr. 

09 

Sec. trib. (upstr.) 




Flood control 
Conservation 
Water quality 
Minimum 
Summer 

Winter 
Water supply 
Power 
Recreation 
Dead storage 

Multiple use 
Stream bed 
Top of dam 

Period of record minimum 
Period of record maximum 

Normal 

Maximum power 
Minimum power 
Sediment 

Maximum regulated 


Outlet 


WLfi 


>e 






Table 5 


f 


Parameter Codes 


STORET 

HYDRO 

w Q Component 

Units 

00027 

00028 

72025 


01 Code for agency collecting sample 

02 Code for agency analyzing sample 

03 Depth of pond or reservoir 

feet 

00068 


04 Maximum sample depth 

feet 

72020 

72020 

05 Elevation 

ft > msl 

00062 


06 Elevation, reservoir surface 

ft 

72030 


07 Elevation of reservoir pool 

ft > msl 

00054 

00054 

08 Reservoir storage 

acre-ft 

72033 


09 Flow, average daily, spillway 

cfs 

72034 


10 Flow, instantaneous, spillway 

cfs 

00061 


11 Stream flow, instantaneous 

cfs 

00060 

00060 

12 Stream flow, daily 

cfs 

00065 

00065 

13 Stream stage 

feet 

00010 

00010 

14 Temp 

deg-C 

00011 


*14 Temp 

deg-F 

00300 

00300 

15 02 Dissolved 

mg/1 

00299 


16 02 Dissolved, electrode 

mg/1 

00090 


17 Oxidation reduction potential 

mv 

00094 


18 Specific conductivity, field 

umhos/cm 

00095 

00095 

19 Specific conductivity, lab 

umhos/cm 

00400 

00400 

20 pH (field) 

su 

00403 


21 pH (lab) 

su 

00410 


22 Alkalinity, total as CaCo3 

mg/1 

00435 


23 Acidity, total as CaCo3 

mg/1 

00900 


24 Hardness, total as CaCo3 

mg/1 

00940 

00940 

25 Chloride 

mg/1 

00945 

00945 

26 Sulfate total 

mg/1 

01045 


27 Iron, total as Fe 

ug/1 

71885 


*27 Iron total as Fe 

mmg/1 

01046 


28 Iron, dissolved 

ug/1 

01055 


29 Manganese, total as Mn 

ug/1 

71883 


*29 Manganese total as Mn 

mmg/1 

01056 


30 Manganese, dissolved 

ug/1 


(Continued) 


* Redundant water quality parameter code. 
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Table 5 (Continued) 


STORET 

HYDRO 


WQ Component 

Units 

00916 


31 

Calcium, total 

mg/1 

00915 


32 

Calcium, dissolved 

mg/1 

00927 


33 

Magnesium, total 

mg/1 

00925 


34 

Magnesium, dissolved 

mg/1 

00929 


35 

Sodium, total 

mg/1 

00930 


36 

Sodium, dissolved 

mg/1 

00937 


37 

Potassium, total 

mg/1 

00935 


38 

Potassium, dissolved 

mg/1 

00070 

00070 

39 

Turbidity 

jtu 

00074 


40 

Turbidity, transmissometer, percent 





transmission 

percent 

00076 


41 

Turbidity, Hach turbidometer 

ftu 

00078 


42 

Transparency, Secchi 

m 

00077 


*42 

Transparency, Secchi 

in 

00031 


43 

Light, percent remaining at given depth 

percent 

00034 


44 

Depth at which 1 percent of surface 





light rem 

ft 

00080 


45 

Color, true 

pt-co units 

00081 


46 

Color, apparent 

pt-co units 

00955 


47 

Silica, dissolved 

mg/1 

00956 


48 

Silica, total 

mg/1 

00310 


49 

B0D5 

mg/1 

00405 


50 

Carbon dioxide 

mg/1 

00680 


51 

Carbon total organic 

mg/1 

00681 


52 

Carbon dissolved organic 

mg/1 

00685 


53 

Carbon total inorganic 

mg/1 

00691 


54 

Carbon, dissolved inorganic 

mg/1 

00665 


55 

Phosphorus, total as P 

mg/1 

00650 


*55 

Phosphate, total as P04 

mg/1 

71886 


*55 

Phosphorus total as P04 

mg/1 

00666 


56 

Phosphorus, dissolved as P 

mg/1 

00669 


57 

Phosphorus total hydrolyzable as P 

mg/1 

00678 


58 

Phosphorus, hydrol + ortho, total. 





autoanal 

mg/1 

00671 


59 

Phosphorus, dissolved ortho as P 

mg/1 

00660 


*59 

Phosphate, ortho as P04 

mg/1 

70507 


60 

Phosphorus, inorganic total ortho as P 

mg/1 


(Continued) 


* Redundant water quality parameter code. 
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Table 5 (Continued) 


STORET 

HYDRO 


WQ Component 

Units 

00600 


61 

Total N 

mg/1 

71887 


*61 

Nitrogen total as N03 

mg/1 

00605 


62 

Organic N 

mg/1 

00610 


63 

Ammonia N 

mg/1 

71845 


*63 

Ammonia, total as NH4 

mg/1 

00625 


64 

Total Kjeldahl N 

mg/1 

00630 


65 

N02 + N03-N 

mg/1 

00615 


66 

N02-N 

mg/1 

71855 


*66 

Nitrite total as N02 

mg/1 

00613 


67 

Nitrite nitrogen dissolved as N 

mg/1 

71856 


*67 

Nitrite dissolved as N02 

mg/1 

00620 


68 

N03-N 

mg/1 

71850 


*68 

Nitrate N as N03 

mg/1 

00618 


69 

Nitrate nitrogen dissolved as N 

mg/1 

71851 


*69 

Nitrate N dissolved as N03 

mg/1 

00500 


70 

Residue, total 

mg/1 

00505 


71 

Residue, total volatile 

mg/1 

00515 


72 

Residue, total filtrable dried at 





105 deg C 

mg/1 

00530 


73 

Residue total non-filtrable dried at 





105 deg C 

mg/1 

80154 

80154 

74 

Suspended sediment cone - evap at 





110 deg C 

mg/1 

70300 

70300 

75 

Residue total filtrable at 180 deg C 

mg/1 

32209 


76 

Chlorophyll-A fluorometric, corrected 

ug/1 

32217 


77 

Chlorophyll-A fluorometric, uncorrected 

ug/1 

32211 


78 

Chlorophyll-A trichromatic, corrected 

ug/1 

32210 


79 

Chlorophyll-A trichromatic, uncorrected 

ug/1 

32230 


80 

Chlorophy11-A 

mg/1 

60050 


81 

Algae, total 

cells/ml 

00570 


32 

Biomass, plankton 

ml/1 

85209 


83 

Algal growth potential 

mg/1 

60990 


84 

Zooplankton, other 

no/liter 

31616 


85 

Fecal coliform, memb filter, m-fc broth 





44.5 deg 

no/100 ml 

31673 


86 

Fecal streptococci, memb filter, kf 





agar, 35 deg 

no/100 ml 

31679 


87 

Fecal strep mf m-ent 

no/100 ml 


(Continued) 


* Redundant water quality parameter code. 
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Table 5 (Concluded) 


STORET 

HYDRO 

W Q Component 

Units 

50051 


88 Flow rate instantaneous 

mgd 

50053 


89 Conduit flow - monthly 

mgd 

70301 

70301 

Dissolved solids - sum of constituents 

mg/1 

80155 

80155 

Sediment discharge 

tons/day 

70291 

70291 

Dissolved sulfate discharge 

tons/day 

70290 

70290 

Dissolved chloride discharge 

tons/day 

70302 

70302 

Dissolved solids discharge 

tons/day 
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PART V: LISTS - PROJECT LISTS 


21. LISTS, the second major file group, defines the referencing 
system used for CE projects in the data base. It consists of the fol¬ 
lowing files: 

LISTS.CPL - Central Project List 
LISTS.DPL - Deleted Project List 

The development and contents of these files are discussed below. 

22. The data base is built around a central list of 299 reservoirs 
which have been identified from various sources and placed in the LISTS.CPL 
file. The regional distribution of these projects is shown in Figure 2. 

Breakdowns by CE district and division are given in Table 6. The follow¬ 
ing have been used as criteria for inclusion: 

a. projects currently operated by the Corps of Engineers. 

b. projects having seasonal or permanent pools. 

The second criterion has been applied to eliminate locks and small run- 
of-the-river impoundments with short hydraulic residence times and little 
opportunity for inducing water quality changes, at least with respect to 
eutrophication. 

23. The two primary sources of information used to develop an 

initial project list include a tabulation of CE projects with surface 

areas greater than 500 acres compiled by Leidy and Jenkins 5 and a map 

0 

of CE water resource projects . Based upon CE Water Resource Development 

7 

Reports and information supplied by various CE district offices, the 
initial list has been screened to eliminate projects which are incom¬ 
plete, not currently under CE control, and/or do not have appreciable 
pools. A separate list of impoundments which have been eliminated has 
been maintained for future reference (LISTS.DPL). Because it has not 
been feasible within the scope of this project to compile and incorporate 
data from detailed, project-specific reports, the current project list 
may contain some impoundments which do not conform to the above criteria. 

Similarly, some projects may have been missed. Inclusion and/or screening 

i i 






of additional projects would be possible with more time devoted to com¬ 
piling and examining detailed reports. 

24. The record format used in the LISTS.CPL and LISTS.DPL files 
is given in Table 7. Piles are listed in Tables 8 and 9, respectively. 
Each project has been assigned a unique, three-digit identification 
code to facilitate referencing in the data base. The location of each 
project is identified by CE division, district, state, county, latitude, 
longitude, and hydrologic unit. Hydrologic unit maps compiled by the 

Q 

U. S. Geological Survey (USGS) have been used to provide basic location 
data. Reservoirs lying on the boundaries of states, counties, and/or 
hydrologic units have been referenced based upon dam location. State 
and county codes refer to the standard federal coding system (FIPS) 
documented in the EPA's STORET 5 user's manual. The latitudes and longi¬ 
tudes of projects in which surface elevation monitoring stations have 
been located refer to those stations, which occur most frequently at dam 
sites. In other situations, coordinates have been approximated from 
hydrologic unit maps and refer roughly to dam locations. 

25. As shown in Table 1, the project list is cross-referenced to 
three independent data bases: 

9 

a. the EPA National Eutrophication Survey Working Papers . 

b. the U. S. Department of Agriculture (USDA) compilation of 
reservoir sedimentation data-*-®. 

c. the CE project file compiled by Leidy and Jenkins 5 . 

The cross-referencing system facilitates access to specific information 
on projects contained in these sources. 
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PART VI: WATS - WATERSHED CHARACTERISTICS 


26. WATS, the third major file group, contains information on 

project watersheds. It consists of the following three elements: 

WATS.maps - Watershed Maps 

WATS.POLYS - Watershed Polygon Coordinates 

WATS.DAREAS - Drainage Area Characteristics 

This information supplements the location descriptors contained in the 

LISTS files. Each element is described below. 

27. A set of watershed maps has been compiled from the USGS hydro- 

8 9 

logic unit maps , EPA National Eutrophication Survey Working Papers , 

and a report on CE projects in the New England Division 11 . The maps are 
labeled with descriptive data contained in the LISTS.CPL file and stored 
in a loose-leaf notebook. Hydrologic unit maps, used most extensively, 
are on a scale of 1:500,000. An example is given in Figure 3. In some 
cases, projects have been built after map publication and only the water¬ 
sheds are shown. 

28. WATS.POLYS contains the latitude/longitude coordinates of 
polygons which contain projects and their watersheds. These coordinates 
have been used to identify water quality monitoring stations in STORET 
(see Part IX). Each record contains up to five coordinate pairs and is 
referenced by district and project codes. Coordinates have been estimated 

from watershed maps contained in EPA National Eutrophication Survey 

9 

reports . The 108 projects which were sampled under that program are 
represented in WATS.POLYS file. 

29. WATS.DAREAS contains additional descriptive information on 
project watersheds in the format given in Table 10. Each record is 
referenced by district, project, and data source codes. Table 11 lists 
the elements of the file along with corresponding data sources. Some 
discrepancies among multiple data sources for the same project and 
characteristic remain in the file, particularly in drainage areas and 
mean flows. These could be resolved through verification of the file 
at the district level. An inventory of the file contents by division 
is given in Table 12. Inventories by project and district are given in 
Appendix A. 

i 




Figure 3 


Sample Watershed Map 
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• RES: 194 PCMME OE TERRE * 
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» MAJOR TRIBUTARY: POMME OE TERRE « 
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Sources of Data for the WATS.DAREAS File by Component 
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PART VII: RESER - RESERVOIR CHARACTERISTICS 

30. The fourth major file grouping, RESER, contains detailed 
information on reservoir characteristics. It consists of the following 
four elements: 

RESER.MORPHO - Project Morphometry 
RESER.desc - Verbal Project Descriptions 
RESER.broch - CE Recreational Brochures 
RESER.COM - Comments 

Each of these elements is described below. 

31. The fluctuating pool levels characteristic of many reservoirs 
necessitates the compilation of morphometric data which are referenced 
to pool elevations. Volume and surface area variations with elevation 
are required for estimation of volume-averaged water quality conditions, 
given measurem nts made at specific depths. Seasonal variations in 
reservoir volume and discharge induce variations in mean depth and 
hydraulic residence time which may, in turn, influence the response of 
trophic state indicators to nutrient loading. Thus, detailed morphometric 
information is an essential component of the data base for eutrophication 
modelling and for other general uses. 

32. The record format and contents of the RESER.MORPHO file are 
described in Table 13. Each record is referenced by district, project, 
elevation, and data source code. In addition to area, volume, length, 
width, and shoreline length data, the file contains a series of pool and 
outlet codes, as listed in Table 4. These codes provide supplementary 
descriptive information on pool allocations for various uses, ranges of 
operating levels, and locations and types of principal outlets. 

33. The RESER.MORPHO file was initially based upon data extracted 
from project design memoranda. The other principal data sources include: 

(1) a report by Leidy and Jenkins 5 ; (2) USGS water resources data re- 

13 12 

ports, by state and year ; (3) sedimentation survey sheets ; and 

(4) district and division offices. Information compiled from these 

sources has been coded and sorted by project and elevation. Initial 

screening was done to identify and correct, where possible, any 
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obvious errors, such as decreasing area or volume with increasing eleva¬ 
tion within a given project. There were sufficient inconsistencies among 
the various data sources for many projects to warrant independent veri¬ 
fication of the file. Accordingly, the file was distributed to the 
districts through WES and additions and corrections were made based upon 
district responses. Final editing was done to eliminate most of the 
redundancies in the file due to multiple data sources for the same project 
and elevation. A current data inventory by division is given in Table 14. 
Inventories by district and project are given in Appendix A. 

34. RESER.desc consists of a collection of verbal project descrip¬ 
tions copied from USGS water resources data reports published annually 

by state^. The descriptions are referenced by district and project 
codes and assembled in a loose-leaf notebook. These descriptions summa¬ 
rize hydrologic monitoring activities by the USGS, along with important 
project characteristics and purposes. The file currently contains 
entries for 260 out of 299 projects in the central project list. 

35. RESER.broch is a collection of brochures published by CE dis¬ 
trict and division offices as guides to recreation in specific projects. 
These usually contain detailed project maps which are useful for locating 
monitoring stations. Project purposes and characteristics are also sum¬ 
marized. Each folder is referenced by district and project number and 
stored in a hanging file. Currently, the file contains information from 
eight districts: Pittsburg, Huntington, Louisville, Nashville, Vicksburg, 
Tulsa, Forth Worth, and Sacramento. 

36. The RESER.COM file contains miscellaneous descriptive infor¬ 
mation on various projects. This file has been designed to hold data or 
comments which do not conform to other file formats. Each record is 80 
characters long and is referenced by district and project codes. A 
listing of the current version of this file is given Table 15. 
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Totals 299 296 295 296 288 293 285 246 212 146 227 



Table 15 


Listing of the RESER.COM File 


01157 Mad River dam now under control of State of Connecticut 
01165 Connected to Hopkinton (01167) at high water 
01167 Connected to Everett (01165) at high water 
07232 Project transferred to Wilmington District (06) 

26363 Sediment survey refers to Old Waco 
28408 No permanent pool 

30214 2 Separate lakes (W. twin/E. twin) below elevation 1342 

30333 Project never reached full pool 

33298 Re-regulating dam for Green Peter (33299) 

33305 Re-regulating dam for Detroit (33293) 

34048 Sediment survey refers to Old Don Pedro 
34051 Sediment survey refers to Old Exchequer 






PART VIII: HYDRO - HYDROLOGY FILES 


37. The fifth file group, HYDRO, contains detailed hydrologic 

data, organized in the following files: 

HYDRO.KEY - Station Key 

HYDRO.DAILY - Daily Values 

HYDRO.MONTHLY - Monthly Summaries 
HYDRO.YEARLY - Yearly Summaries 

HYDRO.SUM - Grand Summaries 

This information has been compiled to provide bases for nutrient budget 
calculations, estimating pool hydraulic residence times (as influenced 
by reservoir elevation and discharge), and depth-averaging of water 
quality observations (as influenced by reservoir morphometry and pool 
level). Because of the stringent water quality sampling requirements 
for estimation of nutrient budgets, streamflow data required for such 
calculations have been compiled only for those projects and years 
sampled by the EPA National Eutrophication Survey. Attempts have been 
made, however, to compile reservoir discharge and elevation/contents 
data from all projects for 1965 to date using three data sources: USGS/ 

14 ^5 

WATSTORE , the EPA National Eutrophication Survey , and sedimentation 
12 

survey sheets . The HYDRO.KEY file describes 1307 stations from all 
three data sources in the format depicted in Table 16. 

38. The first data source includes USGS stations monitoring 
reservoir elevation/contents or streamflow at or below reservoir dis¬ 
charge points. These stations have been identified using the Master 

Water Data Index of the USGS National Water Data Exchange 1 *’ and USGS 

13 

water resources data reports by state and year . Daily values have 
been retrieved through STORET^ for the period from 1965 to the most 
recent available as of February, 1980. Only those stations with daily 
values entered in WATSTORE are included. 

4 

39. The EPA National Eutrophication Survey , which sampled 108 CE 
projects, assembled a hydrologic data base coirpatible with its water 
quality sampling network for use in nutrient budget computations. It 
includes streamflow estimates for upstream and downstream tributaries. 
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For each station, flows are estimated on three time scales: daily (only 
for the days on which water quality samples were taken), monthly (only 
for the months in which water quality samples were taken), and normalized 
monthly (normal flow for each month). This information has been retrieved 
from a tape provided by the EPA Corvallis laboratory 15 . Monthly and 
normalized monthly flows have been stored in the HYDRO.MONTHLY file. 
Because they only refer to water quality sampling dates, daily flows 
have been stored along with the water quality data in the WQ.OBS file. 

40. The third source of hydrologic data, sedimentation survey 

12 

sheets , has provided annual estimates of reservoir total inflow, mini¬ 
mum elevation, and maximum elevation, typically for 10 water years in 
most of the 84 projects for which sedimentation survey sheets have been 
located. This information has been stored in the HYDRO.YEARLY file. 

41. The formats of the DAILY, MONTHLY, YEARLY, and SUM hydrology 
files are listed in Tables 17 through 20, respectively. HYDRO.DAILY, 
which contains data from USGS/WATSTORE stations only, is in the WATSTORE 
format. It is linked to the project list through the sequence number 
stored in the HYDRO.KEY file. The other hydrology files contain direct 
references to districts and projects. In retrieving daily values, all 
parameter codes recorded at each station were included. Thus, the daily 
file, and the monthly, yearly, and grand summaries generated from it, 
contain some water quality information monitored by the USGS on a daily 
basis (e.g., temperature, conductivity, suspended solids). Parameter 
codes and coverage are indicated in Table 5. 

42. Monthly variations in reservoir discharge and elevation are 
needed in order to provide bases for calculating pool hydraulic resi¬ 
dence times and volume-averaged water quality conditions on a seasonal 
basis. Table 21 presents an inventory of reservoir discharge, elevation, 
and contents data monitored at USGS stations and contained in the 
HYDRO.MONTHLY file. Table 21 is organized by division. Corresponding 
inventories by reservoir and district are contained in Appendix A. Since 
the monthly hydrologic summary has been generated from the daily values 
file, these inventories also reflect daily data holdings. 



43. The files contain reservoir discharge data for 245 out of the 
299 projects in the central project list. Elevation and contents data 
are included for 44 and 108 projects, respectively. Regional deficiencies 
in elevation or contents data are particularly evident for the New England 
North Atlantic, South Atlantic, Ohio River, and Missouri River Divisions. 
These deficiencies need to be corrected, probably using district-level 
information sources, in order to provide a basis for model evaluations 
under Phase II of this project. 
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PART IX: WQ - WATER QUALITY FILES 


Introduction 


44., The sixth group of files, WQ, contains water quality infor¬ 
mation for CE projects., It is organized as follows: 

WQ.KEY - Station Key 

WQ.DESTAT - Detailed Station Descriptions 
WQ.OBS - Observations 

WQ.SUM - Data Summary by Station and Parameter 

WO.maps - Station Maps 

Three sources of water quality data have been used: (1) EPA's STORET 

3 17 

system ; (2) the INFONET system used by the Ohio River Division of the 
Corps of Engineers; and (3) miscellaneous survey data for specific 
projects. The numbers of stations and observations obtained from each 
source are listed in Table 22. The use of these sources and resulting 
file structures and contents are described in the following sections. 

STORET Data Acquisition and Processing 

45. As in Table 22, STORET is the primary source of water quality 

information. The following sources have been used to identify station 

codes and to associate them with specific CE projects: 

a_. the Master Water Data Index (MWDI) maintained by the 
National Water Data Exchange of the USGS^^. 

b. the USGS Catalogue of Information on Water Data^®'^. 

c. a list of stations included in the National Stream 
Quality Accounting Network (NASQAN) maintained by the 
USGS20. 

9 

d. the EPA National Eutrophication Survey Working Papers . 
and 

£, direct station identification retrievals from STORET 
using a latitude/longitude search technique. 

These sources are described below in the order used. 








Table 22 


Inventory of Water Quality Data by Source 


Sample 


Agency 


Stations 

Dates 

Observations 

STORET - 

EPA/NES 

1,637 

16,119 

181,173 

STORET - 

USGS 

655 

35,326 

592,853 

STORET - 

CE 

470 

34,890 

322,337 

STORET - 

States 

541 

18,243 

235,894 

STORET - 

Other 

357 

8,078 

154,266 

INFONET ■ 

- ORD 

763 

16,995 

534,412 

Miscellaneous 

28 


2,259 

TOTAL 


4,451 

129,821 

2,023,194 
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46. The Master Water Data Index (MWDI) documents water quality 
and quantity monitoring activities by various local, state, and federal 
agencies throughout the U. S. and contains information on site loca¬ 
tion, agency, dates, types of measurements, and data storage media. 

It does not contain measurements, but serves as a means of locating 
them. The MWDI registers all stations in the EPA STORET and USGS 
WATSTORE systems, in addition to information on monitoring activities 
by agencies which do not participate in other federal data banks. 

Thus, this file represents the most comprehensive one available for 
identifying monitoring sites and locating data. An initial list of 
monitoring stations associated with specific projects was derived by 
applying a latitude/longitude search technique to two large station 
files acquired on tape from the National Water Data Exchange. One 
contained all stations in the 'J. S. monitored by the Corps of Engineers, 
and the other contained all lake or reservoir stations monitored by any 
agency in the U. S. The station/project matching derived from this 
search was verified manually by checking station location descriptions 
and consulting maps, when needed. 

47. The station listings derived from the MWDI did not contain 
stations monitored by non-CE agencies on tributary streams. The second 
and third sources listed above provided additional tributary stations 
operated primarily by the USGS. NASQAN^*"* stations are particularly 
data-rich, having been operated by monthly frequencies since 1975 with 

a broad water quality parameter coverage. Forty-nine such stations have 
been located in or directly below the watersheds of projects in the 
central project list* 

48. The STORET station list also includes all stations operated 
by the EPA National Eutrophication Survey in 108 CE projects. These 
stations include upstream tributaries, point sources, reservoir stations, 
and reservoir discharge stations. Nutrient loading calculations for 
these projects will provide a basis for the testing loading models in 
Phase II. 






49. Finally, a series of station identification retrievals were 
done directly in STORET using a search technique based upon the latitude/ 
longitude polygons contained in the WATS.POLYS file. Because of cost 
and time considerations, this technique was applied only to projects 
which were in one of two categories: (1) sampled by the EPA National 
Eutrophication Survey (since these will be the primary focus of Phase II 
modelling efforts); or (2) without water quality data derived from 
other station searching techniques. An extracting option available 

in STORET was also used to identify only stations for which total phos¬ 
phorus measurements were available. 

50. Experience with these alternative station searching techniques 
indicates that no one method is completely satisfactory. Each relies upon 
the accuracy of the station characteristics and coordinates entered in 
the STORET file. Polygon search techniques often retrieve extraneous 
stations or miss relevant stations because of inaccurate latitude/longi¬ 
tude entries in the STORET station file. Similarly, retrievals which 
depend upon station types (e.g., "stream" vs. "lake”) will miss stations 
which have been inaccurately classified* For example, many stations 
located in reservoir pools (based upon location descriptions and/or 
coordinates) were classified as "stream" stations in STORET and MWDI* 

The variety of methods employed to identify stations has helped to pro¬ 
vide reasonable project coverage. All station/project matchings in the 
final STORET station list have been checked manually with reference to 
verbal station location descriptions and maps. 

51. Preliminary STORET retrievals have been used to screen out 
sites with little or no relevant information and to verify station codes. 
Results have been obtained in the STORET Inventory format, which lists 
station descriptions and statistical summaries of water quality components 
monitored. Based upon these inventories, most stations with only one 
sampling date have been eliminated from the station list. 

52. Following the screening procedure, a second series of STORET 
retrievals has been used to obtain copies of the data on tape for all 
observations made after 1964. The most comprehensive retrieval format 


62 







available from STORET has been used. This format, termed "M0RE=5", 
provides complete station descriptions along with observations of up 
to 50 different water quality variables at each station. 

53. A total of 100 variables have been selected for inclusion in 
the data base, as listed in Table 5. This necessitated two retrievals 
for each station. The selection of variables is based upon the objec¬ 
tives of the project and upon the results of the preliminary station 
inventories, which gave initial indications of data availability as a 
function of parameter code. The list contains some redundancies due 

to multiple ways of expressing various types of measurements (e.g., 
temperature as degrees F or degrees C or phosphorus as P or PO^). Con¬ 
version routines have been used bo eliminate these redundancies in tape 
processing. 

54. In a final step, the STORET tapes have been processed to 
generate one file containing station descriptions (WQ.DESTAT) and another 
containing water quality observations (WQ.OBS). This involved several 
sort/merge steps to combine data from the individual STORET tapes in a 
sequenced form. Overall, STORET has provided 1,486,523 observations at 
3,660 stations. 


INFONET Data Acquisition and Processing 

55. The INFONET 17 system used by the Ohio River Division to manage 
water quality data has been accessed as a second source of information. 
Five tapes have been obtained from ORD, one containing station descrip¬ 
tions and the other four containing water quality observations for each 
of the four ORD districts (Pittsburgh, Huntington, Nashville, and Louis¬ 
ville) . Because the organization and formats of the INFONET tapes are 
different from those obtained from STORET, a different set of programs 
has been written and employed to extract the data and process it into a 
form suitable for merging with output from the STORET tape processing. 

56. A systematic procedure has been used in extracting data from 
the ORD tapes. In the first step, stations of interest have been selected 
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from the ORD station tape based upon ORD project identification codes 
and used to generate a station description file keyed to members of the 
central project list. A list of primary and secondary ORD station codes 
has been extracted from the station description tape. In the final step, 
the station code and parameter code files have been used to extract 
relevant observations from the ORD data tapes. This process has been 
repeated for each district and the resulting files have been merged. 

A total of 763 stations and 534,412 observations have been derived from 
this data source. 

Miscellaneous Data Acquisition and Processing 

57. Water quality data acquired from STORET and INFONET have been 

supplemented with miscellaneous data which has been manually coded and 

entered directly into the water quality files. This has been done to 

improve the regional coverage of the water quality data base. This 

relatively time-consuming approach has been limited to two sources: 

21 

(1) survey data obtained from Baltimore District for Almond, Whitney 

Point, and Alvin R. Bush Reservoirs; and (2) survey data obtained from 

22 

the North Central Division for Eau Galle Reservoir and Lac Qui Parle. 

58. Water quality data and station descriptions from these sources 
have been coded at WES. Keypunching and verification have been done at 
MIT. The resulting files containing 28 stations and 2259 observations 
have been merged with data from STORET and INFONET in the formats des¬ 
cribed below. 


WQ File Structures 

59. The water quality data base consists of four files (WQ.KEY, 
WQ.DESTAT, WQ.OBS, WQ.SUM) and a set of station maps (WQ.maps). The 
formats of the four files are given in Tables 23 through 26, respectively. 
The structure and contents of each are discussed below. 
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Table 24 (Concluded) 





























































60. WQ.KEY (Table 23) contains station source and location des¬ 
criptors and accounting information on the amount of data in the WQ.OBS 
file, including the number of observations, number of sampling dates, 
date range, and depth range. Station descriptors have been derived from 
the WQ.DESTAT file, and accounting information from the WQ.OBS file. 

Each station has been given a unique, 8-digit identifying code. The 
first two digits represent CE district (Table 1) and the next three 
represent CE project (Table 8). The last three contain a code which is 
unique within each project. The following conventions have been used 
in assigning the last three digits of the station code: 

001 - 100 STORET stations retrieved in March of 1979 
101 - 200 STORET stations retrieved in March of 1980 
301 - 400 EPA National Eutrophication Survey stations 
501 - 600 INFONET stations (Ohio River Division) 

801 - 900 stations entered manually 


Station numbers have been assigned sequentially within each category and 
project, after sorting the stations by STORET agency and STORET station 
codes. The station coding scheme permits sorting and analysis by 
station, district, project, station, and/or data source. The WQ.KEY 
file contains 4451 records (one per station), sorted by the 8-digit 
station code. 

61. WQ.DESTAT contains detailed information on station location 
and data source. As shown in Table 24, it contains four record types. 

The fourth type is repetitive and contains up to 15 lines of detailed 
descriptive text on each station. WQ.DESTAT contains about 31,000 
records, sorted by the first ten digits of each record (station code/ 
record sequence number). 

62. WQ.OBS (Table 25) contains water quality observations. Each 
record is identified by station, date, time, depth, and parameter code. 
Standard STORET remark codes identify measurements which are less than or 
greater than indicated values or duplicate values. The last part of 
each record contains composite sample information. WQ.OBS, which con¬ 
tains 2,023,194 records, sorted by the first 24 columns (station/date/ 
time/depth/parameter), is stored on tape (sequential access only). 












63. WQ.SUM (Table 26) contains a water quality data summary by 
station and parameter, derived from analysis of the WQ.OBS file. Statis¬ 
tics include date range, depth range, value range, mean, standard devia¬ 
tion, and value percentiles (25%, 50%, and 75%). Summary statistics are 
derived from the first 1000 observations for each station/parameter combi¬ 
nation. Each record represents one station/parameter combination and the 
file contains about 75,000 records sorted by station and parameter codes. 

64. To provide direct access to water quality station descriptions 
and data summaries, the contents of the WQ.DESTAT and WQ.SUM file have 
been produced in microfiche form. Frame format is illustrated in 

Table 27. The heading of each frame contains the district, project, 
station, and station type codes and names. Station descriptions are 
entered from the WQ.DESTAT file. The data summary by parameter follows. 
Frames are sorted by district, project, and station codes. A new fiche 
card is begun with each district. Card labels indicate the district and 
project described in the first frame. The last frame of each card 
contains an index which lists the project, station, and associated 
frame coordinates. 

65. WQ.maps is a collection of station maps (one per project) 
which have been produced on a Calcomp line plotter using information in 
the WQ.KEY file. An example is given in Figure 4 which, can be compared 
with the watershed map in Figure 3. Stations are located based upon 
latitude/longitude coordinates. Different plot symbols are used to 
identify station types. Only stations with more than 10 observations 
are plotted. Adjustments in horizontal and vertical scales are made 
for each map so that a linear distance scale is preserved and the map 
fits within an 8.5 x 11 inch area. A scale factor in miles per inch is 
derived from the LISTS.CPL file and plotted with a triangle. These maps 
are useful for identifying station locations and for refining the station 
type codes. They are subject, however, to errors in the station coordi¬ 
nates derived from STORET or INFONET. Based upon the maps and station 
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descriptions, some editing of obvious coding errors in station coordi¬ 
nates has been possible. 


WQ Data Inventories 

66. Table 28 presents an inventory of water quality data by station 
type and division. Corresponding inventories by project and district 

are given in Appendix A. Overall, 271 out of 299 projects are represented 
in the water quality files. Remaining regional deficiencies include 
St. Paul District (6 out of 13 projects), Portland District {9 out of 17 
projects), and Los Angeles District (0 out of 2 projects). 

67. An inventory by station type and parameter code is given in 
Table 29. As expected, temperature, pH, and oxygen are the most fre¬ 
quently represented parameter codes in the file. No data were located 
for one code (09—average daily spillway flow). 

68. Phosphorus, chlorophyll-a, and Secchi depth data are parti¬ 
cularly relevant to assessing eutrophication problems and therefore to 
Phase II modelling efforts. Table 30 presents an inventory of these 
measurements made at pool stations (type codes 2, 4, or 5) by division. 
Corresponding inventories by project and district are given in Appendix 
A. Out of 299 projects in the central project list, total phosphorus 
data have been located for 211, chlorophyll-a data for 132, and Secchi 
depth data for 171. Some regional deficiencies in total phosphorus data 
are evident particularly in the New England Division (12 out of 22 
projects). North Atlantic Division (6 out of 15 projec’s), North Central 
Division (6 out of 16 projects), and North Pacifi~ D ion (6 out of 

27 projects) . These are also reflected in the ^.•ropi. i-a and Secchi 
depth inventories. These inventories indicate that the data base will 
not be sufficient to assess the trophic state of all CE projects with 
appreciable pools. The complete coverage for roughly 130 projects 
indicates, however, that the data base should be generally sufficient 
for model testing purposes, the primary objective of Phase II. Regional 
testing of models should also be possible, with a few exceptions (e.g., 
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the lack of chlorophyll-a or Secchi depth data for projects in New 
England). 

69. The EPA National Eutrophication Survey is a primary source 
of data for testing nutrient loading models and relationships among 
within-pool measures of trophic state. An inventory of data holdings 
by agency, station type, and parameter code is given in Table 31. Two 
agency groupings are used: "EPA", representing the National Eutrophi¬ 
cation Survey; and "OTHER", representing all other agencies and monitoring 
programs. Station types include tributary (type code 1), discharges 
(type code 3), and pool (type codes 2, 4, and 5). The parameter list 
includes a variety of nutrient, biological, chemical, and optical 
characteristics pertinent to eutrophication analysis. Within each 
category, the numbers of observations, sampling dates, stations, projects, 
and districts are indicated, along with the total period of record in 
station-months. Including data from agencies other than the EPA/NES has 
more than doubled the total numbers of observations of most parameters 
and provided additional useful measurements for some projects, such as 
algal cell numbers and volumes, turbidity, and suspended solids. At 

pool stations, non-EPA agencies provide more chlorophyil-a observations 
(2036 vs. 1499) concentrated in fewer projects (43 vs. 108). These and 
other statistics in Table 31 reflect the relative intensities of .i-vency 
monitoring efforts—many of the non-EPA programs are more intensive 
and extensive temporally than the three-date, one-growing-season program 
employed by the EPA/NES. 

70. Based upon data inventories in Appendix A, some projects for 
which non-EPA data monitoring of trophic state indicators has been 
partic ly intense include the following: 

06-372 John H. Kerr 

15- 399 Eau Galle 

16- 311 East Branch Clarion River 

18-093 Monroe 

18- 120 Barren River 

19- 340 J. Percy Priest 

26-359 Sam Rayburn 

29-111 Pomona 

32-204 Kookanusa 


79 






Table 31 


Inventory of Eutrophication-Related Water Quality Components 
by Station Type and Monitoring Agency 


TYPE: 




_TR I B_ 



_D S C H_ 



PPOl 


AGENCY: 


EPA 

OTHR 

ALL 

EPA 

QTHR 

ALL 

EPA 

OTHR 

ALL 

TOTAL 

P 

NOBS_ 

_9794_ 

.23.496, 

_33290_ 

1450 

J_2475_ 

_13925 _ 

_6689_ 

15145 21834 

TOTAL 

P 

NDAT 

9771 _ 

22370 

32141 

1428 

1 1895 

13323 

1410 

78S9 

9307 

total 

P 

NSTA 

806 

997 

1803 

1 1 6 

416 

532 

479 

737 

1216 

total 

P 

_N PRJ_ 

109 

214 

_236 

_ioa_ 

21 1 

_237 

1 08 

1 88 

71 1 

total 

P 

NOIS 

27 

30 

31 

27 

29 

30 

27 

28 

29 

total 

P 

mths 

8469 

35292 

43761 

1230 

21050 

22280 

2628 

18388 

21016 


ORTHO-P 

NOBS 

9879 

12198 

22076 

1 456 

5433 

6889 

6668 

1 2851 

19=19 

ortho-p 

NDAT 

9855 

1 0386 

20741 

1 434 

5063 

6497 

14 18 

5931 

7349 

ortho-p 

_N ST A 

808 

726 

1 534 

_115 

31 1 

427 

479 

=76 

mss 

ortho-p 

N PR J 

1 09 

186 

223 

108 

193 

225 

1 oa 

167 

205 

ortho-p 

NOIS 

27 

29 

31 

27 

28 

30 

27 

26 

29 

ortho-p 

MTHS 

8459.15528.24087 

1218. 8670 9888 2628 

1 1 6 64 

1 4 1 R7 

NH3-N 

NOBS 

1 0002 

14010 

24012 

1 464 

6393 

7857 

6632 

1 2294 

1 8966 

NM3-N 

NDAT 

Q 979 .13380 _2 3359 

_1442_ 

=264 . 7706 

_L4 1 8 

= 078 

7396 

NH3-N 

NSTA 

308 

829 

1637 

116 

343 

459 

479 

639 

1118 

NH3-N 

N PRO 

1 09 

197 

221 

108 

198 

226 

1 08 

169 

202 

NH3-N 

NDIS 

_27 

_29. 

3Q 

27 

28L 

30 

27 

76 

79 

NH3-N 

MTHS 

8508 

21659 

30167 

1231 

1 1 633 

12864 

2628 

14074 

16702 

TKN 

NOBS 

Q 970 11554. 21 524._ 

1461_ 

= 325 

_6186 

_=:75_ 


1 7535...- 

TKN 

NDAT 

9953 

1 1032 

20985 

1439 

5211 

6650 

1 39i 

4764 

6145 

TKN 

NSTA 

304 

777 

1581 

1 16 

276 

392 

470 

= 25 

995 

TKN 

N PRO 

109 

_ 175. 

_206_ 

_LQg_ 

1 73 

71 1 

_1C5_ 


_ 192 _ 

TKN 

NDIS 

27 

29 

31 

27 

26 

30 

26 

24 

28 

TKN 

MTHS 

8505 

18057 

26562 

1230 

8830 

10060 

2599 

1 0869 

1 3468 

N02*3-N 

NOBS 

9999 

12425 

22424 

1 464 

6000 

7464 

6632 

1 1 385 

1 8067 

N02+3-N 

NDAT 

9975 

1 1907 

21883 

1 442 

5906 

7348 

1418 

53C2 

6720 

N02-3-N NSTA 

80 7.- 

741 

1546 

1 1 6 

276 

2°2 

4?^ 

C 15_ 

234 _ 

NC2+3-N 

N PR J 

1 09 

165 

1 94 

108 

154 

188 

1 08 

1 29 

167 

N02+3-N 

NDIS 

27 

28 

29 

27 

23 

29 

27 

24 

28 

N02+3rfi _ 

_M JHS_ 

_8510 

17657 25177 

1231 

9360 .10591 

7^2° 

1 ? * Jfi 

147?4_ 

N03-N 

NOBS 

5979 

17079 

23057 

784 

10629 

11413 

0 

6553 

6563 

NC3-N 

J3DAI_ 

_5268_ 

16133 22101 

_za_i_ 

QQ 87 10768 . 

0 


4,7 7 g_ 

N03-N 

NSTA 

773 

526 

1299 

111 

274 

385 

0 

409 

409 

N03-N 

N PRJ 

106 

173 

208 

107 

168 

21 1 

0 

153 

153 

N03-N 

_NDLS _ 

28 

28 

30 

_ 25 

26 

29 

0_ 



N03-N 

MTHS 

4904 

20780 

25684 

617 

13582 

14199 

0 

7=88 

7 = 38 

DHL-A 

NOBS 

3 

_743_ 

746 

6 

318 

->24 

14 = 9 



CHL-A 

NDAT 

3 

728 

731 

6 

313 

319 

1 490 

1 129 

2619 

CHL-A 

NSTA 

1 

74 

75 

2 

30 

32 

482 

1 47 

629 

CHL-A 

N PRJ 

1- 

29 

TO 

t 

19 

19 

1 Qfi 

4_^ 

1 32 

chl-a 

NDIS 

1 

14 

15 

! 

12 

1 2 

27 

13 

28 

CHL-A 

MTHS 

5 

11 89 

1 194 

8 

377 

385 

2660 

3145 

= 805 

ALGAE <#) 

NOBS 

0 

1058 

1058 

0 

875 

875 

0 

1076 

1076 

ALGAE(») 

NDAT 

0 

1053 

1053 

0 

875 

875 

0 

907 

907 

ALGAE! 

NSTA 

o 

48 

48 

Q 

32 

32 

0 


1 'J'l 

ALGAE(A) 

N PRJ 

0 

26 

26 

0 

29 

29 

0 

59 

59 

algae(*) 

NDIS 

0 

12 

12 

0 

14 

14 

0 

12 

12 


... M-THS_ 

Q 



_Q_ 

WnvvM 


_0. 

1437 

1432— 


(Continued) 


80 














Table 31 (Concluded) 


Tvpg; 
AGENCY! 

AtGAE(V) 

_NOBS_ 

EPA 

0 

TRI8 

OTHR 

7 

ALL 

7 

EPA 

_g_ 

DSCH 

OTHR 

0 

ALL 

0 

EPA 

0 

POOL 

OTHR 

_344_ 

ALL 

_444_ 

A LoA£(y) 

NOAT 

0 

7 

7 

0 

0 

0 

0 

430 

430 

ALGAE(V) 

NSTA 

0 

1 

1 

0 

0 

0 

0 

40 

40 

AlGAEtV] 

N PR J 

0 

1 

1 

0 

0 

0 

0 

14 

14 

A LGAE ( V )~ 

NOIS 

0 

1 

1 

0 

0 

0 

0 

1 

1 

A LGAE(V) 

MTHS 

0 

6 

6 

0 

0 

0 

0 

832 

832 

""SECCHI 

NOBS 

3 

11 81 

1 184 

6 


426 

1487 

5091 

6578 

SECCHt 

NOAT 

3 

1152 

1155 

6 

349 

355 

1478 

4969 

6447 

SECCHI 

NSTA 

1 

172 

173 

2 

35 

37 

401 

406 

977 

SECCHI 

NPRJ 

1 

42 

43 

1 

28 

29 

108 

121 

171 

SECCHI 

NOIS 

1 

14 

14 

1 

12 

12 

27 

20 

28 

SECCHI 

MTHS 

5 

2137 

2142 

8 

597 

605 

2629 

1 2314 

1 4043 

TRANS(X) 

NOBS 

s 

40 

45 

24 

12 

36 

6781 

2556 

9337 

transixl 

NOAT 

2 

23 

25 

6 

9 

15 

1419 

368 

1787 

TRANS(X) 

NSTA 

1 

18 

19 

2 

8 

10 

481 

96 

577 

TRANS(X) 

NPRJ 

1 

11 

12 

1 

8 

9 

1 08 

21 

121 

TRANS ( X {_ 

_N 0 I S_ 

t 

2 

3 

1 

1 

2 

_27 

4 

27 

TRANS(X) 

MTHS 

3 

6 

9 

8 

10 

18 

2484 

816 

3300 

lIGHTfX) N CBS 

0 

66 

66 

0 

60 

60 

486 

3467 

3953 

LIGHT(X) 

NOAT 

0 

21 

21 

0 

24 

24 

329 

531 

860 

UOHUKI 

NSTA 

0 

16 

16 

0 

12 

12 

203 

201 

404 

LIGhT(X) 

NPRJ 

0 

10 

10 

0 

12 

12 

52 

42 

91 

i-ICHT (X) 

NOIS' 

0 

1 

1 

0 

3 

3 

12 

7 

18 

LIGHT(X) 

MTHS 

0 

7 

7 

0 

17 

17 

335 

494 

829 

— turbioTt' 

NOBS 

0 

24440 

24440 

0 

14708 

14708 

0 

22258 

22258 

T'JRSIOIT 

NOAT 

0 

23815 

23815 

0 

14555 

14555 

0 

1 0580 

l 0580 

TU90IOIT_ 

NSTA 

0 

_90 4_ 

904 

0 

_350_ 

350_ 

0 

762 

_T52_ 

TUflBIOIT 

NPRJ 

0 

201 

201 

0 

i 99 

1 99 

0 

1 71 

1 7 1 

TUR0IOIT 

NOIS 

0 

30 

30 

0 

28 

28 

0 

25 

25 

TUR8I0IT 

MTHS 

0 

_24945_ 

24045 

0 

1 2907 

1 2907 

0 

19675 

19675 

SUSP SOL 

NOBS 

0 

12054 

12054 

0 

5766 

5766 

0 

10669 

1 0669 

SUSP SOL 

NOAT_ 

0 

1 1 470 

1 1 470 

0 

5672 

5672 

0 

4234 

.12 34 

SUSP SOL 

NSTA 

0 

686 

686 

0 

264 

264 

0 

429 

429 

SUSP SOL 

NPRJ 

0 

1 68 

168 

0 

169 

169 

0 

1 16 

1 16 

SUSP SOL NOIS 

0 

_29 

_29_ 

_Q_ 

26 

26_ 

0 

2X 

21 

'SUSP SQL 

MTHS 

0 

18422 

1 8422 

0 

8277 

8277 

0 

8401 

8401 


OXYGEN_NOBS_8 4 2216_42224_1 8_292 L6_29234_ 

OXYGEN NDAT 3 33919 33922 5 25106 26112 

OXYGEN NSTA 1 1062 1063 2 412 414 

Oxygen_n prj _t_2.' 2_212_t_21 1 _217. 

OXYGEN NOIS 1 30 30 1 29 29 

OXYGEN M THS 5 40323 40328 8 22191 22199 


FLO* 

FLO* 

FLO* 

N03S 

NOAT 

NSTA 

7934 
7933 
_5.90 

24977 
23692 
_S33_ 

32911 

31625 

1123 

1447 
1447 
_109- 

15753 

15173 

268 

17200 

16620 

376 

0 

0 

0 

1468 

1373 

03 

1468 

1373 

93 

FLOW 

NPRJ 

106 

156 

185 

106 

175 

210 

0 

35 

35 

FLO* 

NDIS 

27 

30 

31 

27 

25 

30 

0 

17 

17 

FLO* 

MTHS 

■-LkLJ 


2B482 

■Sul 

KekEFI 

IIIHP 

0 


IBD 


_£ 202 _! 
1494 
4S2 
_L0 6 
27 

2668 


17465 18959 
1051 1533 


29 

31533 


29 

34201 










Thus, it will not be necessary in Phase II to rely exclusively upon 
data from the EPA/NES for assessing relationships among within-pool 
measures of trophic state. The stringent water quality and flow sampling 
program requirements required for estimation of nutrient budgets and 
time limitations of this project suggest, however, that EPA/NES data be 
used exclusively for evaluating nutrient loading models. 









PART X: SED - SEDIMENTATION DATA 


i 


71. The seventh major file group describes sedimentation charac¬ 
teristics of CE reservoirs. It contains the following elements: 

SED.sheets - Sedimentation Survey Sheets 
SED.RATES - Sedimentation Rate File 

This information has been derived from a collection of sediment survey 
data for U. S. reservoirs compiled in 1975 by the Agricultural Research 
Service of the U. S. Department of Agriculture^' A total of 84 CE 

projects in the central project list were included in that compilation. 
These are identified by the sedimentation survey key in the LISTS.CPL 
file (Table 8). 

72. SED.sheets consists of a collection of the most recent sedi¬ 
mentation survey sheets contained in the appendix to the U. S. Department 
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of Agriculture (USDA) compilation . These sheets contain detailed in¬ 
formation on project location, morphometry, hydrology, as well as sedi¬ 
mentation. An example is given in Table 32. Sheets have been assembled 
in a loose-leaf notebook, identified, and arranged by district and proj- 
ject code. 

73. SED.RATES is a file containing the most recent estimates of 
sedimentation rates for each of the 84 projects located in the USDA com¬ 
pilation^ 0 . The format of this file is given in Table 33. Many of the 
sedimentation rate measurements antedate the water quality file. For 45 
projects, the most recent survey data available were taken during or 
before 1965. Rate estimates for some projects, however, will be useful 
for testing relationships between sedimentation rate and nutrient trapping 
efficiency during Phase II of this study. 




i. 
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Table 32 (Concluded) 
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PART XI: NES - EPA NATIONAL EUTROPHICATION SURVEY DATA 


74. The eighth file group consists of data and reports obtained 

4 9 

from the EPA National Eutrophication Survey ' , exclusive of the water 
quality, hydrologic, and drainage area data described in previous sec¬ 
tions. It consists of the following elements: 

NES.reports - EPA/NES Working Papers 

NES.COMP - Compendium of Lake and Reservoir Data 

NES.SUM - Compendium Summary 

This file group contains information on 108 CE projects as well as the 
other 704 lakes and reservoirs which were sampled by the EPA/NES through¬ 
out the U. S. Besides providing detailed information on point-source 
nutrient loadings to CE projects needed for nutrient budget estimation 
in Phase II, it provides a basis for comparisons of lakes and reservoirs 
with respect to morphometric, hydrologic, and nutrient loading response 
characteristics. Such comparisons, in turn, can provide means of inter¬ 
preting the performance of lake nutrient loading models in reservoirs. 

75. During 1972-75, the EPA conducted a systematic survey of 812 
lakes and reservoirs in the U. S. The original objective of the National 
Eutrophication Survey was to develop a data base for assessing the 
impacts of point-source nutrient discharges on the trophic conditions 

9 

of lakes and reservoirs. The NES produced a series of working papers 

summarizing and interpreting the results for each impoundment or lake 

monitored. Results have also been summarized in a compendium format and 

23 

published in four volumes 

76. NES.reports consists of the collection of working papers des¬ 
cribing NES surveys and results in each of 108 CE projects. Each report 
has been referenced to the data base by district and reservoir code. 

Maps extracted from the reports have been included in the WATS.maps file. 

77. NES.COMP is a computer file containing a nationwide summary 
of NES data for 775 lakes and reservoirs, obtained from the Corvallis 

4 

laboratory of the EPA . A sample printout of data from the file is 
given in Table 34. Similar printouts have been produced for all CE 
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projects in the compendium file and arranged in a notebook by district 
and project code. The compendium contains data for 106 out of 108 CE 
projects sampled by the EPA/NES. 

78. NES.SUM is a summary of the NES.COMP file, with a few modifi¬ 
cations. The format of the summary file (Table 35) is more convenient 
for input to statistical analysis programs, since one record is used 
for each lake or reservoir. A number of missing latitude and longitude 

coordinates have been added, based upon estimates derived from USGS 

£ 

Hydrologic Unit Maps . Each lake or reservoir has also been referenced 
by CE division to provide a basis for regional contrasts of lake and 
reservoir characteristics. 

79. A few simple analyses have been done to provide descriptions 
of types and amounts of data contained in the NES.SUM file. The regional 
distribution of lakes and reservoirs described in the file is depicted 

in Figure 5. A breakdown of impoundment numbers as a function of trophic 
state, impoundment type, and CE division is given in Table 36. The 
numbers of CE projects are listed as a function of division and trophic 
state in Table 37. 

80. The results of preliminary lake/reservoir comparisons which 
have been made using data from the NES compendium tape are summarized 
in Table 38. Three groups of impoundments have been compared: natural 
lakes (N=310); non-CE reservoirs (N=359); and CE reservoirs (N=106). 

The stratification of thirty-six original and conposite variables across 

these groups has been studied using the BMDP-77 conputer program 1 . 

Within-group means and Analysis of Variance (ANOVA) statistics are 

listed in Table 38. Detailed results with histograms are presented in 
.24 

a previous report . 

81. While the scope of this report precludes detailed interpreta¬ 
tions of the data, a few comments on these results seem appropriate: 

a. At the 95% significance level, differences across groups 
are evident in all cases except longitude, conductivity, 
phosphorus retention coefficient, nitrogen retention 
coefficient, and the second Vollenweider phosphorus 
loading statistic 25 . 
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b. Strongest differences across groups are apparent for 
drainage area (F=77.5), drainage area/surface area 
(F=73.6), outflow rate (F=60.7), and drainage area/ 

* maximum depth (F=53.4). 

c. Reservoirs have higher potential phosphorus concentra¬ 
tions, based upon phosphorus loading and the Vollenweider 
statistics, but lower observed phosphorus and chlorophyll-a 
concentrations than natural lakes. This is an indication 
that the Vollenweider statistics may give biased predic¬ 
tions in reservoirs (i.e., over-predict loading impact). 

d. Reservoirs have less transparency, despite less chloro¬ 
phyll-a. This is probably an effect of mineral turbidity 
in reservoirs. 

e. N/P loading ratios are somewhat lower for reservoirs than 
for natural lakes; the reverse is true, however, for 
inorganic N/dissolved P ratios measured during the summer 
within the impoundments. 

_f. The composite statistic (Secchi depth/mean depth) is 
strongly stratified across groups (F=44.8). Assuming 
that the depth of the photic zone is roughly twice the 
Secchi depth, the means in Table 38 indicate that typi¬ 
cally 62% of the volume of lakes is available for 
photosynthesis, compared with 30% in the case of CE 
reservoirs. Reservoirs probably have greater shoreline 
development, however, and may be more influenced by photo¬ 
synthetic processes occurring in littoral zones. 

g. While a 4-degree difference is evident in the average 
latitude of CE reservoirs as compared with lakes, a 
detailed look at Figure 5 reveals that the distribution 
of NES lakes is bimodal, with clusters in the North 
(primarily Minnesota, Wisconsin, Michigan, and Maine) 
and in the extreme South (Florida). The maximum reser¬ 
voir density occurs between these extremes. 

Because of the strong regionality of the lake vs. reservoir distributions 
evident in Figure 5, it is difficult to separate the effects of impound¬ 
ment type from those of regior with analyses of the type described above. 
Thus, in future, more complete analyses, it will be important to control 
for regional differences by comparing subsets of lakes and reservoirs 
within defined regions. 
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OTHER = combination of two or more trophic states 
GRAND TOTAL = number of impoundments in LISTS.CPL file 



Table 38 


Summary of Lake/Reservoir Comparisons Derived from EPA/NES Compendium 

WITHIN-GROUP MEANS ANOVA RESULTS 

(N-309) (N-360) (N-106) 


* 

c 

~ 

Variable 

Nat. 

Lakes 

Non-CE 

Reserv. 

CE 

Reserv. 

F 

Prob 

(greater F) 


Latitude (degrees N) 

41.6 

39.0 

37.6 

34.8 

< .0001 

A 

Longitude (degrees W) 

91.7 

93.6 

92.3 

1.9 

.16 

G 

Drainage Area dan 2 ) 

222 . 

1358 . 

3228 . 

77.5 

<.0001 

G 

Surface Area Oca 2 ) 

5.6 

3.6 

34.5 

44.4 

N 

G 

Volume (km 2 m) 

27.3 

50.8 

239 . 

36.4 

I* 

G 

Mean Depth ( a ) 

4.5 

5.7 

6.9 

10.4 

M 

G 

Maximum Depth (m) 

10.7 

15.8 

19.8 

18.8 

ft 

G 

Hydraulic Res. Time (yr) 

.74 

.23 

.37 

23.5 

ft 

G 

Overflow Rate (m/yr) 

6.5 

25 . 

19 . 

40.9 

ft 

G 

Outflow Rate (km 2 m/yr) 

47.9 

236 . 

650 . 

60.7 

•• 

G 

Dr. Area/ Surf. Area 

33 . 

156 . 

93 . 

73.6 

•• 

G 

Dr. Area/Volume(1/m) 

6.6 

26 . 

13 . 

39.5 

ft 

G 

Maximum Depth/Mean Depth 

2 . 5 

2.8 

2.9 

7.3 

.0008 

G 

Relative Depth 

.40 

.47 

.30 

8.6 

.0002 

G 

Surf A/ Max Depth 

.53 

.56 

1.7 

24.6 

< .0001 

G 

Dr. Area/Max Depth 

20 . 

86 . 

162 . 

53.4 


A 

pH 

8.06 

7.74 

7.63 

24.1 

<.0001 

G 

Alkalinity (mg/1) 

87 . 

63 . 

6 $. 

8.6 

.0002 

G 

Conductivity (UMhos/cm) 

253 . 

214 . 

208 . 

2.4 

.0950 

G 

Total Phosphorus (mg/1) 

.054 

.053 

.039 

3.6 

.0268 

G 

Dissolved P (mg/1) 

.021 

.018 

.011 

9.5 

<.0001 

G 

Inorganic Nitrogen (mg/1) 

.20 

.26 

.30 

9.7 

<.0001 

G 

Secchi Depth (m) 

1.4 

1.2 

1.1 

7.7 

.0005 

G 

Chlorophyll-a (mg/1) 

14 . 

10 . 

8.9 

11.4 

<.0001 

G 

Algal Assay Yld (mg/1) 

2.5 

2.2 

1.3 

4.0 

.0194 

G 

2 

P Loading (g/» -yr) 

.87 

2.9 

1.7 

29.1 

< .0001 

A 

P Retention Coef. 

.36 

.35 

.40 

0.7 

.51 

A 

NPS Load Fraction - P 

.72 

.80 

.81 

8.4 

.0002 

G 

N Loading (g/m 2 -yr) 

IS. 

45 . 

28 . 

22.6 

< .0001 

A 

N Retention Coef. 

.24 

.20 

.17 

2.5 

.081 

A 

NPS load Fraction - N 

.87 

.93 

.96 

18.7 

<.0001 

G 

N Load/P Load 

20 . 

16 . 

16 . 

8.0 

.0004 

G 

Inorg N/ Diss. P 

9.1 

14 . 

26 . 

29.4 

< . 0001 

G 

♦Vollenweider Stat. 1 

.31 

.56 

.42 

15.2 

It 

G 

+Vollenweider Stat. 2 

056 

.067 

.057 

1.9 

.1586 

G 

Secchi D/ Mean Depth 

.31 

.20 

.15 

44.3 

<.0001 


* C = Code for Type of Mean (A - Arithmetic, G - Geometric) 

+ Statistics for Assessing Phosphorus Loading Impacts on Lake Eutrophication; 
Stat.1 - Lp/e s ' 5 -(ref.25); Stat.2 - (Lp/Q a )/{1 + /r) - (ref.26) 







PART XII: NUMERICAL CHARACTERIZATION OF RESERVOIR HYPSOGRAPHIC CURVES 


Introduction 


82. Estimates of reservoir volume and area variations with eleva¬ 
tion are required for computing volume-averaged concentrations and pool 
hydraulic residence times. This information is generally available in 
the form of a table which lists area at and volume below specific eleva¬ 
tions. Morphometry tables have been compiled for most of the 303 CE 
projects in the data base. This paper describes numerical investigations 
of these tables, specifically covering the relative accuracies of various 
curve-fitting and interpolation techniques. Results will be used to 
design methods for summarizing, storing, and applying morphometric 
information in other phases of the project. Results are also relevant 

to other aspects of reservoir management, specifically including the 
estimation of reservoir volumes and sedimentation rates from range survey 
or contour area data. 

83. This study has been performed prior to completion of the 
project morphometry file. A subset of data has been selected which 
covers 147 projects, each with at least five listed elevations and with¬ 
out obvious errors (such as decreasing areas or volumes with increasing 
elevation). A total of 1285 elevation/area/volume data points have been 
compiled for these 147 projects from various sources. Results of this 

study are summarized below. Details can be found in a working paper 

2 6 

submitted as a progress report under this contract 

Approach 

84. The objective is to design and evaluate methods for estimating 
area and volume at any elevation, giver information available for specific 
elevations. Methods can generally be classified as curve-fitting or 
interpolating. Accuracies of these methods can be assessed by comparing 
observations and predictions of the following: 








a. V(V) = volume estimated from volume/elevation points 

b. A(V) = area estimated from volume/elevation points 

c_. A (A) = area estimated from area/elevation points 

d. AV(A) = incremental volume estimated from area/elevation 
points 

Statistics a and c are simple curve-fitting or interpolating problems. 
Statistic b involves differentiation of the volume curve to estimate area 


~ 3V 

A(V) = || (1) 

where 

Z is the total depth, ft 

Statistic d involves integration of the area curve between specific 
depth limits (Z^ and Z^) to estimate incremental volume: 


AV (A) 



( 2 ) 


27 

85. In testing each method, a jackknife procedure has been used 
to estimate the above statistics for internal elevation points (i.e., 
tests are not made for the top and bottom listed elevations in each 
project, although these elevations are used in the fitting or inter¬ 
polating process). In applying the jackknife, information from the 
point being tested is not used in estimating the parameters of the pre¬ 
dictive function. For instance, estimates of area or volume at the 
second listed elevation are derived exclusively from information in the 
first and third through last listed elevations. 

86. Reported values and predictions are compared using the follow¬ 
ing error statistic: 


D = 



( 3 ) 


where 

Y = reported value (V, A, or AV) 

Y = estimated value 
D = error statistic 
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For errors of the magnitude studied here, the value of D approaches the 
difference between the reported and predicted values, expressed as a 
fraction of their average. In most cases, the D statistic seems to have 
reasonably symmetric error distributions. For each of the four types 
of area/volume comparisons, estimates of bias and standard error in D 
have been derived for a variety of predictive methods, as described below. 


Curve-Fitting Schemes 


87. A single-term power function has some useful properties for 
fitting these types of curves and is the simplest of the methods evalu¬ 
ated: 


V = 


A = 



(4) 

(5) 


Z = E- E 

o 


( 6 ) 


where 

c , c , b b * enpirical parameters 
v a v a 

Z = total depth (ft) 

E - elevation (ft, msl) 

E = elevation at V=A=0 (ft, msl) 
o 

Model parameters can be estimated using linear regression by transforming 
V, A, and z values to logarithms. The following equalities should hold 
if the model is valid for a particular reservoir: 


3V . _ b v" 1 „ 

— = b v C v Z = A 

.*.c = b c 

a v v 


b = b - 1 
a v 


(7) 

( 8 ) 
(9) 


Equations (8) and (9) can be used to test the consistency of the volume 
and area curves. The slopes of volume and area vs. total depth on a 
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log/log plot should differ by 1.0. Dividing equation (4) by equation (5) 
yields the fol’owing: 



( 10 ) 


If the basic model holds, the ratio of volume to area (or mean depth) 
should be proportional to the total depth. Thus, the model can be tested 
through regression analysis of the following equation: 



( 11 ) 


A t-test can be applied to the mean and standard error bwithin each 
project to determine whether the estimate is significantly different 
from 1.0. 

88. In testing the power function model, the parameters in equa¬ 
tions (4), (5), and (11) have been estimated for each of 147 reservoirs. 
The statistical distributions of the optimal slope parameters (b , b , 
b z ) are shown in Figure 6 on log-normal probability scales. Median 
values are 2.97, 1.97, and 0.97, respectively. Thus, in the "typical" 
reservoir, volume and area increase approximately as the cube and square 
of total depth, respectively. This corresponds to the solid geometry 

of an inverted cone or pyramid. About 10% of the projects have b v 
values exceeding 4 and 4% have b y values less than 2. The parameters 

b and b contain roughly the same information as the statistically- 

^ 28 

defined "lake form" proposed by Hakanson 

89. Based upon t-tests, the b z parameter differs significantly 
(p=.05) from 1.0 in about 35% of the projects. This means that the 
single-term power function model fits about two thirds of the projects. 
In the remaining one third, the slope parameters b v and b a are Inconsis¬ 
tent and/or variable with depth. 

90. Table 39 lists error statistics for this model. Estimates of 
the first (V(V)) and third (A(A)) statistics are derived directly from 
equations (4) and (5), respectively. Areas are estimated from volume 
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1 


1 

1 

Table 39 

Evaluation of the Power Function Model 



Gross* 

Median** 

Statistic 

Standard Error 

Standard 


V(V) 

.097 

.062 

A(V) 

.115 

.081 

A(A) 

.097 

.062 


AV (A) 


177 


081 








curves according to equation (7). Changes in volume within various 
strata are estimated from area curves according to: 


AV (A) 



A 2 2 2- A 1 Z 1 

b a +1 


( 12 ) 


(13) 


Standard errors of D range from .097 (V(V)) to .177 (AV(A)). The higher 
standard error of AV(A) could be related to the fact that it is an incre¬ 
mental value which is more subject to measurement error on a percentage 
basis than is total volume or area. A D standard error of .097 corres¬ 
ponds roughly to a standard error of 10%, or to 95% confidence limits 
of ± 20% in an estimated area or volume. The differences between the 
gross standard error and the median, within-project standard error 
reflects the skewness of the error distribution across projects, i.e., 
the model seems to fit some projects considerably better than others, 
as indicated by the b z distribution. 

91. In order to improve upon the above model, variations in the 
slope parameters b^ and b & with depth must be accounted for. The sim¬ 
plest way of doing this is to include higher-order terms in the regres¬ 
sion equations: 


m 

V* = £ c.Z* 1 (14) 

i=0 1 

m 

A* = £ d.Z*i (15) 

i=0 1 


where 

c^, d^ = empirical coefficients (i=0, m) 

* = superscript denoting log^ Q transformation 
m = maximum degree of polynomial 


The error distributions of these functions have been evaluated for 
maximum degrees ranging from 1 to 5. For m=l, the scheme is equivalent 



to the single-term power function model discussed above. For comparative 
purposes, linear polynomial functions have also been tested: 

m 

V = E c- zl (16) 

i=0 1 

m 

A - £ d.Z 1 (17) 

i=0 1 

For each reservoir, the maximum degree of the polynomial has been limited 
to the minimum of m and the number of elevations in the table minus two. 

92. The error statistics in Table 40 indicate that logarithmic 
polynomials are preferable to linear ones according to most criteria. 

Log transformation tends to linearize the relationships and renders them 
easier to fit with low-order polynomial terms. Some reduction in error 
is achieved by including quadratic and cubic terms in depth. Addition 
of fourth- and fifth-order terms, however, tends to increase estimation 
error, presumably because higher-order polynomials can have local minima 
and/or maxima between the fitted points. These are artifacts of the 
fitting process which can cause large estimation errors between the 
fitted points. Cubic polynomials apparently have about the best combina¬ 
tion of flexibility and smoothness for these purposes and data densities. 

Interpolation Schemes 

93. Interpolation methods can be used as alternatives to the 
curve-fitting techniques described above. Interpolation essentially 
involves fitting different curves to different sections of each volume/ 

area/elevation table. A variety of interpolation methods have been 

26 

tested on the same data set used above . These involve different trans¬ 
formations of the volume and area points, including inverse power func¬ 
tions (first through fifth order) and logarithmic. Of the methods 

tested, one involving logarithmic transformations of volume, area, and 

2 6 

total depth points has been shown to have the lowest error statistics 
The errors, however, are essentially equivalent to the error 
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Table 40 


Evaluation of Polynomial Functions 


Trans fo rmation 

Maximum 

Degree 

Gross Standard Error 

V(V) A(V) A(A) 

AV(A) 

logarithmic 

1 

.097 

.115 

.101 

.182 


2 

.069 

.083 

.081 

.166 


3 

.062 

.067 

.081 

.164 


4 

.216 

.292 

.129 

.173 


5 

1.198 

1.280 

.534 

.491 

linear 

1 

.371 

.615 

.200 

.281 


2 

.249 

.170 

.134 

.212 


3 

.134 

.088 

.106 

.196 


4 

.150 

.085 

• 111 

.193 


5 

.221 

.136 

.147 

.265 
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characteristics of the best curve-fitting scheme (log/log cubic poly¬ 
nomials) . Results suggest that either of these methods probably 
approaches the limits of data accuracy. 

Conclusions 


94. In one sense, numerical interpolation methods are preferable 
to fitted curves because the former are more flexible and do not rely 
strongly upon particular forms or curve shapes. Interpolation requires 
storage of and access to the entire table, however, as opposed to a few 
parameters in the case of a fitted function. Fitted curves also provide 
some smoothing of the entries in the table which may serve to filter 
errors. 

95. Based upon relative errors, storage requirements, and compu¬ 
tational considerations, log/log cubic polynomials seem to be the best 
alternative for summarizing hypsographic curves, given data of the 
type examined here. The approximate equivalence of the V(V) and A(V) 
error statistics indicates that storage of the parameters of the volume 
curve alone would be adequate as a basis for estimating both area and 
volume. Algebraic differentiation of the volume polynomial can be used 
to estimate area at any given depth. Thus, both the area and volume 
curves can be summarized by a total of four polynomial parameters 
according to the following scheme: 


Z* = log. (E - E ) (18) 

10 o 

V* = log 1Q (V) (19) 

A* = log^ Q (A) (20) 

V* = c + c,Z*+ c_Z* 2 +c,Z* 3 (21) 

o 1 2 3 

A = (c + 2c Z* + 3c Z* 2 )V/(E - E ) (22) 

12 3 o 
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For cases in which data from only a few elevations are available and/or 
the quadratic or cubic terms do not add appreciably to accuracy (as 
assessed via stepwise polynomial regression), first- or second-order 
polynomials may be used. 

96. Use of the method requires knowledge of the base elevation, 

E q . If not available, it can be estimated approximately by extrapolating 

a plot of vs. E to the horizontal axis. Any error in E estimated 

o 

in this way would be offset in subsequent estimation of the polynomial 
coefficients. 

97. The same functions can be used in very data-limited situations 
in which only estimates of mean depth, maximum depth, and surface area 
are available. Approximate parameter estimates in this case are given 
by: 



c 

o 


c 3 = 0 


z /z 

max' mean 


log (AZ ) - c log (Z ) 
10 mean X 10 max 


(23) 

(24) 

(25) 


98. To insure that the fitted polynomial is not used outside of 
the range of data availability, maximum and minimum elevations should 
also be stored with the fitted coefficients. It will be necessary to 
test the fitted coefficients to insure that the areas estimated through 
differentiation (equation (22)) are positive and increasing with eleva¬ 
tion throughout the application range. 
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SLOPE 


Figure 6 

Distributions of Volume and Area Slope Parameters 
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PART XIII: VARIABILITY OF TROPHIC STATE INDICATORS IN RESERVOIRS 


Introduction 

99. The development and testing of empirical eutrophication models 
for reservoirs requires averaging of water quality measurements over 
spatial and temporal scales. If within-pool water quality variations 
are not random with respect to date, station, or depth, then summary 
statistics for a given reservoir will depend to some extent upon the 
particular data-reduction method employed. The choice of reduction 
method may, in turn, influence conclusions regarding the adequacy of 
existing models as well as the parameter estimates of any new models 
which may be developed. 

100. There is no standard data reduction procedure which can be 

used prior to model development, testing, or application. Methods have 

included, for example: (1) taking the median or mean of all within-pool 
23 

observations ; (2) sequential averaging over depths, stations, and 

29 30 

dates : (3) seasonal averaging within specific depth ranges ; and 

(4) various weighted-averaging schemes which reflect morphometric char¬ 
acteristics. As compared with natural lakes, many reservoirs pose 
special data reduction problems due to extreme spatial and/or temporal 
variations in conditions. 

101. In this section, a subset of the current CE water quality 
data base is analyzed in order to describe spatial and temporal varia¬ 
tions in trophic state indicators within a group of reservoirs. The 
analysis covers spatial relationships, seasonal relationships, variance 
components, and error estimation. Implications of the results are dis¬ 
cussed with respect to the design of monitoring programs and use of the 
data in model development and evaluation. Details on many of the 

31 32 

procedures used and results discussed below can be found elsewhere ' 
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Data Base 


9 

102. EPA National Eutrophication Survey (NES) data from 484 
stations located within 108 CE projects have been used as a basis for 
this analysis. The relatively uniform sampling program designs used by 
the NES provide data which are suitable for statistical treatment. One 
drawback, however, is that under this program, reservoirs were typically 
sampled only three times during one growing season. In Phase II of this 
project, there are plans to examine data from other agencies, which, in 
many cases, are more intensive and/or cover longer periods. 

103. Surface total phosphorus, Secchi depth, and chlorophyll-a 
values have been expressed in terms of Carlson's Trophic State Indices 
(1^, I T and 1^, respectively)^: 

I p = 4.2 + 33.2 log 1Q P , (26) 

I = 60 - 33.2 log Z (27) 

T ^10 s 

I = 30.6 + 22.6 log B (28) 

D IV J 

where 

P = total phosphorus concentration (mg/m^) 

Z = Secchi depth (m) 

s 3 

B = chlorophyll-a concentration (mg/m ) 

The indices are calibrated such that the three versions are equivalent, 
on the average, when applied to midsummer, epilimnetic data from northern, 
natural lakes. Expression of measurements on the above scales tends to 
reduce the skewness in the distributions of the variables and provides 
benchmarks for assessing reservoir trophic state relationships in com¬ 
parison to those typical of natural lakes. A statistical data summary is 
given in Table 41. 
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Table 41 

Statistical Summary of EPA/NES Trophic Index 
Data from 108 CE Projects 

standard 


Variable 

n 

mean 

deviation 

minimum 

maximum 

X P 

1421 

55.8 

13.1 

24.1 

99.0 


1493 

58.9 

12.7 

25.4 

112.9 

In 

1505 

48.2 

10.3 

o 

• 

00 

81.6 


1 










Case Studies of Spatial Relationships 

104. Spatial variations typical of a few reservoirs are depicted 
in Figures 7-10. Mainstem stations are displayed in downstream order 
within each reservoir (not to scale) and mean values are plotted for 
each version of the Trophic State Index. These plots provide initial 
perspectives on the types of spatial trends and relationships which can 
be found in reservoirs and are important supplements to the more formal 
statistical treatment of the data presented in subsequent sections. The 
plots seem to illustrate a number of important controlling processes, 
which are discussed below in relation to each reservoir. 

105. Figure 7 contains data from the White River system on the 
Arkansas/Missouri border. Four reservoirs are connected in series: 
Beaver, Table Rock, Taneycomo, and Bull Shoals. With the exception of 
Taneycomo, they are all deep reservoirs with hydraulic residence times 
in excess of 250 days. Trophic State Index behavior in the First reser¬ 
voir, Beaver, is considerably different from that typical of the down¬ 
stream impoundments. Concurrent reductions in I and I T most likely 
reflect sedimentation and the three index curves do not converge until 
the dam. Once most of the sediment load has been removed in Beaver, 
agreement among the index curves is good at most downstream stations. 
Increases in Table Rock probably reflect the influence of a major point 
source which accounts for more than 70% of the total phosphorus loading 
to that reservoir. The relatively low values of I in Taneycomo can be 
explained by the direct influence of subsurface discharge from upstream 
Table Rock Dam. Taneycomo has a short hydraulic residence time (7 days) 
and surface water temperatures at the times when summer chlorophyll-a 
samples were taken roughly matched temperatures at the 100-foot level 
just above Table Rock Dam (~15°C). Taneycomo's short hydraulic residence 
time is apparently insufficient to permit recovery of temperatures and 
algal populations from those typical of the Table Rock hypolimnion. 
Decreases in all versions of the index are evident moving downstream in 
Bull Shoals, and relatively stable conditions are reached over the last 
four stations. 
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TROPHIC STATE INDEX 


Figure 7 White River System 

WHITE RIVER SYSTEM 
BEAVER TABLE ROCK T.C. BULL SHOALS 
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Figure 8 Sakakawea 

SAKAKAWEA (t=584 days) 





















106. Lake Sakakawea (Figure 8), on the Missouri River in Montana, 
shows TSI variations over 40 units, or sixteenfold differences in trans¬ 
parency and total phosphorus. Upstream portions were classified by the 
EPA/NES as "hyper-eutrophic" and stations near the dam as "oligotrophic". 
Below the second station, the chlorophyll-a index follows but remains 
roughly 5-10 index units below the phosphorus and transparency indices. 

107. Old Hickory (Figure 9), on the Cumberland River in Tennessee, 
shows relatively small downstream reductions in phosphorus and trans¬ 
parency and a steady increase in chlorophyll-a. With a mean residence 
time of 11 days and sediment removal by upstream reservoirs on the 
Cumberland River, longitudinal variations in I and 1^ are not as evident 
as in above panels. The gradual increase in I_ might be a hydraulic 
residence time effect, i.e., the time scale required for algal popula¬ 
tions to "equilibrate" with available nutrient and light levels may be 
appreciable in relation to time-of-travel within the pool. 

108. Similar behavior is evident in Lake Barkley (Figure 10), 
which is located further downstream on the Cumberland River and has a 
residence time of 16 days. One important difference is that the phos¬ 
phorus index remains consistently higher than the other versions. Both 
ambient available nutrient concentrations and bioassays indicate, how¬ 
ever, that algal populations in Lake Barkely were nitrogen-limited at 
the times of sampling. Use of I p alone as a measure of nutrient avail¬ 
ability may not be appropriate in this case. 

109. The above case studies illustrate a number of factors which 
can be important in assessing reservoir trophic state relationships: 
sedimentation, upstream impoundment effects, hydraulic residence time 
effects, and nitrogen limitation. Reservoir hydrodynamics partially 
determine the transformations of these and other factors into spatial 
variations in the trophic state indicators. Upstream/downstream varia¬ 
tions contain information on rates and directions of controlling processes. 
Graphing of spatial relationships and expression/analysis in terms of 
distance (river mile) and/or time (time-of-travel) will aid in quanti¬ 
tative data analysis and interpretation. Use of station means rather 






than reservoir means seems to make more sense for model testing purposes. 


Seasonal Relationships 

110. Average seasonal variations in the index components are de¬ 
picted in Figure 11. Station means have been computed and their effects 
removed from the data prior to calculating the mean and standard error 
for each month (March-November) and index component. Analyses of vari¬ 
ance indicate that monthly effects are significant (p< .0001) for each 
component and strongest in the case of chlorophyll-a. The seasonal 
variations depicted in Figure 11 are characteristic of this collection 
of reservoirs but not necessarily of each reservoir individually. 

111. Average seasonal effects on phosphorus and transparency are 
similar: both tend to be lowest during March and midsummer and highest 
during April and November, possibly reflecting seasonal flow and turbid¬ 
ity variations and influences of turnover periods. Monthly effects on 
chlorophyll-a suggest a spring maximum (April-May), followed by a June 
depression, a midsummer maximum, and lower values in November. Tempera¬ 
ture and light effects may be responsible for the relatively low chloro¬ 
phyll-a levels during March and November. The June depression may be 
due to seasonal succession of algal species. A more detailed examination 
of the data indicates that lower June chlorophyll-a levels are character¬ 
istic of about half of the stations sampled in June, while the rest have 
June levels more typical of May or July values. In testing seasonal 
aspects of TSI behavior, Carlson 30 also noted a June depression in chloro¬ 
phyll-a index relative to the phosphorus index in three natural lakes. 

112. Differences among various versions of the index provide a 
measure of "lake-like" behavior, since the index system is calibrated so 
that I Q , l T , and I p values are equivalent, on the average, when applied 
to midsummer, epilimnetic data from northern, natural lakes. Figure 11 
indicates that the range of index means is generally lowest during mid¬ 
summer (approaching 5 during August) and highest during March, June, and 
November (approaching 15). Minor recalibration of the phosphorus and/or 


MONTHLY MEAN INDEX 


Figure J 


Monthly Variations 


in 


liup 



* mean ± 2 standard errors 
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transparency index would bring I and I T into agreement for all seasons, 
since the monthly effect curves in Figure 12 are roughly parallel. Since 
seasonal chlorophyll-a behavior is fundamentally different, however, 
recalibration alone would not eliminate biases (i.e., significant dif¬ 
ferences between I and I or between I and I_) for all seasons. 

BP B I 

113. Thus, seasonal factors must be considered in reducing the 
data and in recalibrating/redesigning the index system for use in reser¬ 
voirs. One approach would be to restrict averaging period to midsummer. 
An alternative would be to include explicit seasonal effect terms in the 
model or index system. These approaches will be investigated in future 
model development work. 


Variance Component Estimation 

114. Two-way Analyses of Variance have been applied to each reser¬ 
voir and index component to test for the significance of variations 
associated with station and sampling date. Results are summarized in 
Table 42. Significant (p <• .1) differences among station means in I p , 

I_, and I were found in 46, 37, and 52 out of 105 projects, respectively. 
Significant differences among date means were found in 62, 67, and 64 
projects, respectively. 

115. The following ANOVA model has been employed to derive pooled 
estimates of the variance components of each version of the index: 


r hi j 


y + r, + s, . + t + e 

h hi h] hi] 


(29) 


where 


y, .. = index measurement in reservoir h at station i on date j 
hi] 

u = grand mean 

r^ = average effect of reservoir h on grand mean 

s, . = effect of station i in reservoir h 
hi 

t, . = effect of date j in reservoir h 
h] 

e, .. = random effect 
hi] 
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Table 42 

Summary of ANOVA Results 

Station Effects 

Date Effects Not Significant Significant** 

Phosphorus Index - 

* 

Not Significant 32 11 

Significant 27 35 

Chlorophyll-a Index - 

Not Significant 33 5 

Significant 35 32 

Transparency Index - 

Not Significant 27 14 


Significant 


26 
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The relative magnitudes of the within- and between-reservoir components 

are of special significance to monitoring and modelling efforts. With 

relatively large within-reservoir components, it would be difficult to 

obtain much accuracy in reservoir summary statistics (e.g., reservoir 

mean) with limited data. This would reduce the explainable variance 
2 

(R ) of any model calibrated to the reduced data set, make it more dif¬ 
ficult to distinguish among alternative model formulations, and increase 

the error associated with model parameter estimates. 

2 

116. SAS procedures have been used to estimate the above variance 
components for each index separately and to estimate analogous covariance 
components for each pair of indices (Ig/I T , Ig/I , I^/Ip). Results are 
shown in Figure 12. The phosphorus and transparency index components 
exhibit similar patterns: between-reservoir differences account for 
60-66% of the total index variance, as compared with 29% in the case of 
chlorophyll-a index. Between-reservoir variances indicate that differ¬ 
ences in chlorophyll-a are considerably less marked than would be 
predicted based upon differences in the phosphorus or transparency indices. 
Conversely, there is greater temporal and random variance in chlorophyll-a 
than in phosphorus or in transparency. 

117. The covariance components on the right-hand side of Figure 12 
provide some insights into relationships among the indices at different 
averaging levels. Tne between-reservoir and between-station covariance 
components are positive in all cases. Thus, the various versions of the 
index correlate positively both among reservoirs and among stations 
within reservoirs. Temporal components indicate a positive covariance 
for phosphorus/transparency but slightly negative covariances for the 
pairs involving chlorophyll-a. Thus, when temporal variations a; a 
given station are analyzed, one would expect, on the average, to find 

a positive correlation only between the phosphorus and transparency 
indices. This correlation might be attributed, for exaitple, to turbidity 
variations following seasonal or short-term (storm-event) flow variations. 
Despite its positive covariance between reservoirs and between stations, 
chlorophyll-a does not covary temporally with the other indices. 


.At 
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118. The EPA/NES data base includes measurements from one growing 
season within any reservoir and does not permit testing for between-year 
variance or covariance components. Thus, it is not possible with this 
data set to test for year-to-year covariance between chlorophyll-a and 
phosphorus or transparency. Distinguishing between seasonal and year- 
to-year variance components will be possible with an expanded data base 
including data from other agencies and monitoring programs. 


Error Analysis 


119. The above analyses demonstrate that within-reservoir varia¬ 
tions cannot be considered random with respect to dates or stations, the 
primary parameters used in monitoring program design. This has impli¬ 
cations for estimating the accuracy of reservoir or station mean values 
calculated from the data set. If variations were random and serially 
independent with respect to station and date, the following statistic 
would be appropriate for estimating the variance of a reservoir mean: 


Var(y) = 


Var (y) 


where 

N = total number of observations within a reservoir for a 
given index 

33 

Using two-stage sampling theory , the following formula is more appro¬ 


priate : 


2 2 2 

a i . a 

Var (y) = — + — + ~ 
n n L N 

s t 


where 

2 

o = mean squared deviation 

n = number of stations sampled 
s 

n = number of dates sampled 

The first term accounts for the influence of between-station varia¬ 
tions, the second for between-date variations, and the third for random 




variations. Note that equation (31) reduces to equation (30) when spatial 

2 2 2 

and temporal variations are insignificant (o s = o t =0 and Var(y)=a^). 
Because both spatial and temporal variations often exhibit patterns or 
trends (see Figures 7-11), they cannot be considered serially independent. 
Thus, equation (31) provides error variance estimates which are approximate 
but useful for assessing the relative contributions of spatial and tem¬ 
poral variance to the variance of reservoir means. 

120. Based upon the total number of observations, stations, and 
reservoirs in the NES data set, the "typical" reservoir survey program 
covers 5 stations (n g ) on 3 dates (n^.) , for a total of 15 observations 
(N). Using these values and the pooled variance cc.nponents depicted in 
Figure 5, equations (30) and (31) have been applied to each version of the 
index and results are displayed in Figure 13. 

121. Considering the effects of spatial and temporal variance 
components (equation (31)) increases mean error by about a factor of 
three over estimates derived from equation (30). Most of the error vari¬ 
ance is due to the temporal component, especially in the case of the 
chlorophyll-a index. The error variances indicate that the EPA/NES 
sampling strategy has provided estimates of reservoir geometric means 
which are typically accurate (p< .05) to within factors of 1.6, 2.2, and 
1.7 for surface phosphorus concentration, chlorophyll-a concentration, 
and Secchi depth, respectively. 

Monitoring Implications 

122. Error analyses can be used to improve upon monitoring program 
designs. For example, given the objective of collecting data to be used 
in estimating a reservoir mean witli minimum variance, the above results 
suggest that an increase in sampling dates would be more effective than 
an increase in sampling stations, because the date effect term dominates 
the error equation. Since the variance component estimates have been 
pooled, these results apply to this collection of reservoirs as a whole 
and not necessarily to each reservoir. The same approach could be 
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applied using parameters estimated for each reservoir individually (n , 

2 2 2 S 
n , N, 0 , a , a ). "Optimal" designs could be formulated based upon 
t s t e 

the error analysis framework and upon functions which relate n g , n t , and 
N to monitoring cost. Given variance components estimated from prior 
monitoring data, improvements in program design (changes in n g , n fc , and 
N) for a given reservoir could be formulated to yield minimum error for 
a fixed total cost or minimum cost for a fixed total error. The approach 
could be expanded to include depth as a third sampling dimension. This 
represents a logical application for the error analysis framework in a 
monitoring context. 



123. In evaluating models, differences between observations and 
predictions can be attributed to three types of error: parameter error, 
data error, and model error. The first reflects uncertainty in the model 
coefficients, the second, errors in the predicted and/or predictor vari¬ 
ables, and the third, influences of factors which are not considered in 
the model structure. Analyses of the type conducted above can be used 

to quantify potential data errors and separate them from the other com¬ 
ponents. This provides insights into the adequacy of a data base for use 
in model testing. If, for example, the data error component dominates, 
it would be difficult to distinguish among alternative models or to 
improve upon them without first improving the data base. 

124. Table 43 summarizes results of regression analyses which 
have been done to summarize relationships among the three versions of 
the index using station mean and reservoir mean values. Results further 
demonstrate the need for recalibration of Carlson's index system in 
applications to reservoirs. The sensitivity of the chlorophyll-a com¬ 
ponent to phosphorus or transparency ranges from .30 to .37 and is in 
all cases significantly different from 1.0, the value obtained when the 
index system is calibrated to natural lakes. In contrast, the sensi¬ 
tivity of I to I is .88 for stations means and .91 for reservoir means. 






! 
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Table 43 

Regression and Corresponding Error Analyses 


*** Total** *** Sampling* Sampling Error 

Model r s, Error Error Total Error 

_ b 


Station 

i Means (n=484)- 






r B = 

28.7 + .35 I p 

.55 

.024 

42.3 

23.1 

.55 


30.0 + .31 I 

T 

.46 

.027 

47.6 

22.7 

.48 

Z P = 

4.0 + .88 I 

T 

.83 

.027 

47.6 

25.9 

.54 














X B = 

28.0 + .37 I p 

.59 

.048 

32.5 

16.3 

.50 

X B = 

31.0 + .30 I m 

T 

.44 

.059 

41.0 

16.0 

.39 

X P = 

2.0 + .91 T 

T 

.83 

.059 

41.0 

22.1 

.54 


* Sampling Error due to error in estimating mean index values within 
each station or reservoir; sampling errors for station mean phosphorus, 
chlorophyll-a , and transparency indices are estimated at 14.3, 21.3, 
and 15.0, respectively; corresponding sampling errors for reservoir- 
means are 11.2, 14.8, and 13.1. 


** Total Error = mean squared residual 

*** s. =» standard error of regression slope 
o 


i 









While the correlation coefficients (and explained variance) are con¬ 
siderably lower for predicting chlorophyll-a (.44-.59) than for predic¬ 
ting phosphorus from transparency (.83), the mean squared prediction 
errors are similar. 

i25. Part of the mean squared error for each regression model can 
be attributed to sailing error in estimating the reservoir and station 
mean values. The sampling error component of the mean squared error for 
each model is estimated from: 

Var(y-bx) - Var(y) + b^Var(x) (32) 


where, 

y = predicted index 
x = predictor index 
b = slope of regression model 

Results of error analyses have been used to estimate sampling errors in 
station mean and reservoir mean values. Results indicate that roughly 
half (39-54%) of the residual standard error can be attributed to sampling 
error in the data values. The remaining variance presumably reflects 
model error, or the effects of factors (e.g., season, sediment) which 
are not accounted for in the model. The choice of averaging method 
(stations vs. reservoirs) does not influence the model coefficient values 
significantly. Mean squared errors are reduced when reservoir mean 
values are used, partially resulting from a reduction in the sampling 
error component. The standard errors of the regression slopes, however, 
increase roughly twofold when reservoir mean values are used in place 
of station means. Thus, using station means permits better definition 
of model coefficients. 

126. The intent of the regression analyses presented in Table 43 
is to demonstrate the error analysis approach and identify influences 
of data reduction method. The models suggest that chlorophyll/phosphorus 
and chlorophyll/transparency relationships in these reservoirs are sig¬ 
nificantly different from those which are typical of natural lakes. 
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The models are inadequate for predictive use, however, because a number 
of important factors have not been considered, including season, nitrogen 
limitation, hydraulic residence time, region, and external sediment 
loading. These and other factors will be considered in developing a 
Trophic State Index system applicable to reservoirs under Phase II of 
this project. 


Conclusions 


127. Statistical models of index (or water quality) variations 
within reservoirs have been shown to be complicated by the effects of 
spatial and temporal variations and by non-randomness or serial depen¬ 
dence in these variations. The above analyses have demonstrated how 
some of these influences can be approximately treated and applied to 
assess data adequacy for computing reservoir means and to improve upon 
monitoring program designs. A more thorough statistical treatment 
would require more intensive data sets and involve the construction of 
more complex statistical models applied separately to each reservoir 
using simulation techniques. This level of detail is not justified or 
feasible within the context of this study. 

128. The following general conclusions can be drawn from the 
analyses of EPA/NES data in previous sections: 

a. Spatial relationships among trophic state indicators are 
important in some reservoirs, especially when stations 
are viewed in downstream order. 

b. Analyses of Variance indicate that within-reservoir vari¬ 
ations are often non-random with respect to stations 
and/or sampling dates within a given year. Variations 
with respect to date are typically stronger than varia¬ 
tions between stations, particularly in the case of 
chlorophyll-a'. 

c. Some of the temporal variance within reservoirs and 
stations is fixed with respect to month or season. 
Seasonal effects on phosphorus are qualitatively similar 
to seasonal effects on transparency but differ from 
seasonal effects on chlorophyll-a. 
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d. Between-reservoir variance in Carlson's chlorophyll-a 
index is roughly one third of the between-reservoir 
variances in the phosphorus and transparency indices. 

< 2 . For this data set, chlorophyll-a variance between 

sampling dates within reservoirs is greater than its 
variance between reservoir means. The reverse is true 
for phosphorus and transparency. 

_f. Covariance components indicate significant positive cor¬ 
relations among the three versions of the index system 
when variations between reservoirs and between stations 
within reservoirs are considered. The chlorophyll-a 
index does not correlate with either of the other indices, 
however, when temporal variations (at a given station or 
within a given reservoir) are considered. 

Because of non-randomness with respect to stations and 
dates, the error variance of reservoir mean values for 
each index are typically three times those estimated from 
the familiar formula for mean variance (a 2 /n). 

h. Given the objective of estimating reservoir means with 
minimum variance, error analyses indicate that increases 
in the number of sampling dates would be generally more 
effective than increases in the number of stations per 
reservoir for improvement in the EPA/NES sampling design. 

i^. Regression analyses indicate that chlorophyll-a levels 
are significantly less sensitive to phosphorus or trans¬ 
parency, as compared with the natural lakes used in 
developing Carlson's index system. Future development 
of an index system for reservoirs should consider the 
effects of season, region, nitrogen limitation, residence 
time, and sediment loading. 

j_. Use of station means, as opposed to reservoir means, in 
recalibrating the index system causes small increases in 
standard error of estimate but substantially reduces the 
standard errors of model parameters. 


Implications for Data Reduction Strategies 


129. The conclusions in the previous section suggest that the fol¬ 
lowing data-reduction/analytical strategies be used in testing and 
developing models under Phase II of this project: 

a. Since spatial variations and trends have been shown to be 
often statistically identifiable and useful for interpreta¬ 
tion purposes, use of station means would seem to be more 
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desirable than use of reservoir means for model testing. 

Use of station means would also permit better definition 
of model parameters. 

b. It would be useful to develop a scheme for spatial orien¬ 
tation of stations within each reservoir with respect to 
major tributary (arm) and distance (river mile). Some 
aggregation of stations based upon proximity may be 
feasible within this framework. 

c. Seasonal factors should be considered in averaging data 
within each station or station group. This would involve, 
for example, estimation of "spring”, "summer", "fall", as 
well as "growing season" and "annual" averages. 

d. For non-NES stations which are sampled during more than 
one year, tests for significant year-to-year differences 
should be made and used to decide whether aggregation of 
data from different years is justifiable. 

e_. While the above analyses have been based exclusively upon 
surface sampled for phosphorus, averaging with depth 
should be considered, at least within the photic zone, 
for testing of relationships among phosphorus, chloro- 
phyll-a, and transparency. 

£_. To permit testing of loading response models, seasonal 

estimates of reservoir means could be derived by averaging 
across stations. Due to longitudinal variations in many 
reservoirs, however, near-dam or discharge conditions 
should also be used as bases for loading model evaluations. 
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PART XIV: EVALUATION OF METHODS FOR ESTIMATING PHOSPHORUS LOADINGS 


Introduction 

130. The estimation of reservoir nutrient budgets entails estima 
tion of the total mass of nutrients passing given sampling points over 
a given period of time, typically at least one year. While continuous, 
flow-weighted composite sampling for concentrations would provide the 
best basis for deriving such estimates, usually only periodic grab- 
samples are available for stream concentrations. These concentrations 
must be integrated with flow records (typically continuous) in order to 
estimate the desired loadings. The purpose of this section is to test 
and compare alternative methods in order to provide some guidance for 
future nutrient budget calculations on CE reservoirs. 


Preliminary Analysis 


131. The relationship between concentraton and flow influences 

the appropriateness of a given loading calculation method at a given 

station. A subset of flow and phosphorus concentration data from the 

current CE data base has been analyzed in order to provide some perspec 

34 

tive on this relationship . The subset includes 86 tributary and 33 
discharge stations, each with at least 25 total phosphorus/streamflow 
pairs obtained from STORET. Table 44 describes the symbols and funda¬ 
mental equations used to characterize flow and concentration variations 

132. Results of the preliminary analysis are given in Table 45, 

with data grouped by station type. The regression model relating the 

logarithm of concentration to the logarithm of flow explains, on the 

average, 12.3% of the variance in concentration at tributary stations 

and 6.6% of the variance in concentration at discharge stations. 

2 

Figure 14 shows that the upper percentiles of the R distribution at 
tributary sites are roughly twice those found at discharge sides. 
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Table 44 


Fundamental Equations and Symbols 

Loading Definition : L = Q C 

W « X + Y 

Flow/Concentration Model : C = a Q* 3 

Y - log^a + b X 

Distributions: Y:(mean= u , std.dev.= a ) 

- y y 

X:(mean= u , std.dev.= a ) 
x x 

Symbols : L » Loading (mass/time) 

Q = Flow (volume/time) 

C * Concentration (mass/volume) 
W = log 10 (L) 

X » log 10 (Q) 

V - log 10 (C) 

a,b = Regression model parameters 
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Table 45 

Preliminary Analysis of Flow/Total P Concentration Relationships 


Station Type 


Statistic 


Tributary 


Discharge 


k ; 

k ' 

number of stations 

86 


33 


h 

fraction of variance explained 
by regression model 

.123 

* .151* 

. 066 

+ 

.078 

V 

residual variance 

.101 

t .075 

.090 

+ 

.050 


serial correlation of residuals 

.228 

t .205 

.300 

+ 

.276 

i. 

conc/flow sensitivity (b) 

.124 

+ .250 

.097 

+ 

.138 

. 

« 

standard dev. of log (flow) 

.573 

t .246 

.600 

+ 

.303 

(• 

standard dev. of log 1Q (conc) 

.323 

t .133 

.297 

+ 

.088 


* mean ± one standard deviation 









Figure 14 



PERCENTILE 
Figure 15 

Distributions of Regression Slopes at Tributary and Discharge Stations 









133. Both the mean and the standard deviation of the regression 
slope (b) are larger in the case of tributary stations. The distribu¬ 
tions of b are compared on a normal probability scale in Figure 15. 

While the medians are nearly equal, the upper percentiles are higher 
and the lower percentiles are lower in the case of tributary stations. 

The wide distribution of b values across stations suggests that loading 
calculation methods must be capable of accounting for alternative flow/ 
concentration relationships. 

134. These results indicate that concentration tends to be more 
flow-dependent at upstream tributary stations than at discharge stations. 
Reservoir pools may buffer variations associated with runoff events 
(which would tend to produce high b values) or point source discharges 
(which would tend to produce low b values). The fact that streamflow 

is highly regulated at reservoir discharge points may also contribute 
to weaker flow/concentration relationships. The serial correlation of 
residuals (i.e., concentration, after the effect of flow is removed) 
tends to be higher in the case of discharge stations (.30 vs. .23 on 
the average). This suggests that seasonal factors or long-term trends 
may have greater influences at discharge points. 

135. Table 46 lists means regression slopes by station type and 
component (dissolved P, total P-dissolved P, and total P, respectively). 
This breakdown is based upon measurements of total P and dissolved P at 
52 tributary and 17 discharge stations. For both station types, the 
mean slopes of dissolved phosphorus with respect to flow are not signi¬ 
ficantly different from zero. In the case of total P-dissolved P, how¬ 
ever, the mean slope is .34 ± .05 at tributary stations and .11 ± .09 

at discharge stations, although the latter is not significantly different 
from zero. The influence of streamflow on the transport of particulate 
phosphorus probably accounts for these results. The dissolved component 
tends to be more independent of flow than the particulate component. 

The relative weakness of the dissolved phosphorus/flow relationship may 
reflect a buffering effect of adsorption chemistry on stream phosphorus 
levels and/or the fact that transport efficiency for dissolved phosphorus 






Table 46 

Concentration/Flow Sensitivities by Component and Station Type 


Component- 


Station 

Type 

Number of 
Stations 

Total P - 
Dissolved P 

Dissolved P 

Total P 

Tributary 

52 

.341 ± .050* 

.019 ± .034 

.239 + .044 

Discharge 

17 

.109 t .087 

.036 1 .101 

.082 t .038 

* mean + one 

standard error of 

mean 
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is not velocity-dependent, as in the case of the particulate fraction. 


Estimation Methods 


136. Four algorithms for estimating loadings have been tested: 

a. average loading. 

b. average concentration x average flow. 

£. flow-weighted average concentration x average flow. 

d. regression estimate based upon log (concentration) vs. 
log(flow) relationship. 

These schemes are described in Table 47. a few other types of regression 
models have also been evaluated, but none proved better than the model 
used in Method 4 (see Table 47). Two approaches to evaluating these 
methods have been taken: one involves subsampling from simulated flow 
and concentration data and the other, subsampling from real flow and con¬ 
centration data. These methods and results are described below. 

Tests Based Upon Simulated Data 

137. Table 48 describes an algorithm used to generate flow and 
concentration time series using Monte-Carlo techniques. This method has 
been used to produce five years of simulated daily-average flow and con¬ 
centration data. Mean loadings have been calculated directly for each 
year and compared with estimates based upon subsampled taken at monthly 
intervals and employing the calculation methods listed in Table 47 

For each year, a total of thirty trials (sets of subsamples) have been 
made, representing regular sampling on day 1 to day 30 of each month, 
respectively. Thus, error statistics are based upon a total of 150 
trials for each method. Results have been found to be insensitive to 
the number of trials. The simulation algorithm is not adequate for 
evaluation of interpolation methods (in which the sequence of samples 
is considered), since no serial correlation or seasonal factors are 
included. The parameters of the simulation model are typical of the 







Table 47 


Estimation Methods 


Method I : Average Loading 

1 C i 


h. " 


Method 2 ; Average Concentration x Average Flow 


Z c, 


Method 3 : Flow-weighted Average Concentration x Average Flow 


t 3 - Q 


C. 


2 *4 


Method 4 aggression Estimate 

Eq i c i fnQ 


b+1 


S *4 


where, 

» flow on senile data i 
« concentration on sample date i 
n ■ number of sample dates 
b « slope of log(c) vs. log(q) regression 

Q - average flow over entire period, based upon continuous flow record 
£ * sub over all sample dates (i"l to n) 



Table 48 


Algorithm for Generation of Flow/Concentration Time Series 


0 . 

0 . 

.52 
.26 

(-.8 -*• . 8 ) 

Algorithm ; 

x i - U x + N(0,l)d x 

Y. = vi + N(0,l)d + (X, - y )b 

i y y i x 


Input Values ; 


= mean of logarithm of flow = 

p = mean of logarithm of concentration = 

d x = standard deviation of logarithm of flow = 

d = standard deviation of logarithm of cone = 

Y (random component) 


b = slope of flow/concentration model 


Symbol ; 

N(0,1) = normal random deviate with mean zero and standard 
deviation one 









data distributions depicted in Figure 15 and Table 45. A range of 
flow/concentration sensitivities (b in Table 47) have been used (-.8 to 
+ . 8 ) . 

138. The following error statistics have been used to compare 
"observed" and estimated loadings for method and trial: 


- 5 t (v * 


(33) 


(34) 


where, 

Bias. = bias associated with method j 
3 

MSEj = mean squared error associated with method j 

M = total number of trials = 150 

= actual loading for trial k (mass/time) 

= estimated loading for method j and trial k (mass/time) 

For each method, these error statistics are plotted vs. the flow/concen¬ 
tration sensitivity statistic in Figures 16 and 17, respectively. 

139. Only Method 1 is unbiased for all values of b. The MSE of 
this method, however, is significantly higher than that of the other 
methods at values of b greater than -.2. This method is best under 
conditions where loading is relatively independent of flow (b+-l). 

This might be the case, for instance, at a station which is located 
below a major point source of nutrients, but not where storm-event or 
non-point loads are significant. 

140. The performance of Method 2, in which concentration and flow 
are averaged independently, is a strong function of b, both with respect 
to bias and variance. This method is unbiased only at b=0, but has 
biases of +35% and -23% at b values of -.2 and +.2, respectively. Trends 
in bias continue moving toward more negative or more positive b values. 
The method has a sharp minimum in MSE at b=0. This scheme is apparently 
best only when concentration and flow are truly independent, but gives 
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substantial biases and MSE's for even weak tiow/concentration relation¬ 
ships • 

141. Method 3, which employs the flow-weighted average concentra¬ 
tion, behaves qualitatively similar to Method 2, but is not as sensitive 
to b values. Zero bias and minimum variance are evident at b=0. This 
method is analogous to a "ratio estimate" used in classical sampling 
theory 

142. Method 4 has less bias and variance than Method 3 for most 

b values. Unlike the other methods, the regression model can adjust to 

different types of flow/concentration relationships and estimation errors 

are less sensitive to b values. Bias becomes more significant at high 

values of b. At b=.6, for example, Method 4 underpredicts loading by an 

average of 15%. The MSE at this b value is .13, which corresponds to a 

standard error of + 36%. Thus, bias is less than one half of one stan- 

2 

dard error and accounts for 17% of the MSE (.15 /.13) at b=.6. Additional 
tests indicate that applying the regression model separately to each 
daily flow in the year, a more tedious calculation, does not reduce the 
bias or variance of Method 4 at any b value. 

Tests Based Upon Real Data 

143. Calculation methods have also been tested using the flow and 
concentration data from the 119 stations analyzed above. Subsamples of 
12 flow/concentration pairs have been taken at regular intervals from 
each station. Loadings estimated for each subsample and method have 
been compared with loadings estimated by applying Method 1 to all flow/ 
concentration pairs available for each station. A total of 508 subsamples 
have been taken from 86 tributary stations and 190 subsamples from 33 
discharge stations. Bias and MSE estimates are given in Table 49 for 
each method and station type. These results are approximate because of 
errors involved in estimating actual loading. 

144. Generally, Methods 3 and 4 appear to perform better than 
Methods 1 or 2 for both station types. The MSE of Method 3 equals that 
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Table 49 

Results of Method Testing Using Real Flow and Concentration Data 

Estimation Method* ** 

Statistic Method 1 Method 2 Method 3 Method 4 

* 

-- -Tributary Stations (N=86,M=508) 


Bias 

-.002 

-.027 

-.029 

-.023 

MSE 

.530 

.297 

.176 

.176 



/ \T— 7 7 M -1 q n > - 








Bias 

.001 

.068 

-.017 

-.000 

MSE 

.272 

.585 

.108 

.156 


* N - number of stations, M » number of trials 

** see Table 47 
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of Method 4 in tributary stations (.176), but is lower in discharge 
stations (.108 vs. .156). When averaged over the range of b values in 
the data set, none of the methods is appreciably biased. 

Prior Error Estimation 


145. To provide bases error analysis and assessment of data 
adequacy, a means for estimating the variance of loading estimates de¬ 
rived from a given set of flow/concentration data is needed. Table 50 
presents a formula appropriate for use with Method 4 (or with Method 3 
when b=0). This approximate formula, derived from expected value theory, 
has two terms: one reflecting variance around the flow/concentration 
regression model, and one reflecting variance in the estimate of the 
slope parameter b. 

146. Figure 18 compares the observed mean squared estimation 
errors for Method 4 with error variances estimated from the formula in 
Table 50 using simulated data. Reasonable agreement between observed 
and estimated variances is apparent over the range of b values typically 
encountered. Similar tests have been done using real flow and concen¬ 
tration data. The formula in Table 50 overestimates the error mean 
square by an average of 18% for tributary stations and underestimates 

the error mean square by an average of less than 1% at discharge stations. 

Conclusions 


147. Preliminary data analysis has characterized the distributions 
of flow/concentration relationships in tributary and discharge streams. 
Methods 3 and 4 are generally better than Methods 1 or 2 for estimating 
loadings, given the distribution of concentration/flow sensitivies en¬ 
countered at various stations (see Figure 15). Method 3, which employs 
the flow-weighted average concentration, is actually a special case of 
Method 4, with b=0. In calculating loadings, it seems reasonable to use 
a regression analysis to estimate the slope parameter b and to use 
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other symbols defined in Tables 46 and 47 





Method 4 unless the slope estimate is not significantly different from 
zero, in which case Method 3 would be used. The formula in Table 50 
can be used to derive approximate estimates of error associated with a 
given mean loading calculated using Methods 3 or 4. Error variances 
can be used, in turn, to characterize the accuracies of reservoir nutrient 
loading and discharge estimates. 





PART XV: CONCLUSIONS 


148. The compilation, description, and preliminary analyses of 
the data base described in previous sections lead to the following con¬ 
clusions : 

a. Using primarily centralized sources of information, it 
has been possible to compile sufficient data on water 
quality, hydrology, sedimentation, and other project 
characteristics to permit testing of empirical eutrophi¬ 
cation models under Phase II of this study. 

b. The hydrology files need to be augmented with monthly 
elevation/contents data from several districts and 
projects. 

£. Information in the water quality files will not be ade¬ 
quate for assessing the trophic states of all 299 reser¬ 
voirs in the central project list. It should be adequate, 
however, for characterizing about 130 reservoirs and for 
testing empirical models relating within-pool measures 
of trophic state. 

d. Use of data sources outside of the EPA/National Eutro¬ 
phication Survey has generally more than doubled the 
total numbers of observations of various within-pool 
trophic state indicators. Non-NES data are generally 
more extensive temporally. 

£. Comparisons derived from the EPA National Eutrophication 
Survey compendium reveal significant differences between 
lakes and reservoirs in the means of most morphometric, 
hydrologic, and trophic state indicator variables. Com¬ 
pared with natural lakes, reservoirs have greater potential 
nutrient enrichment problems, as gauged by Vollenweider 
phosphorus loading models, but lower observed levels of 
chlorophyll-a, on the average. The validity of existing 
loading models in reservoirs is in question due to lake/ 
reservoir differences in morphometry, hydrodynamics, 
sedimentation, and region. Because of the relative geo¬ 
graphical distributions of lakes and reservoirs in the 
U. S., it is difficult to distinguish the effects of 
region from those of impoundment type. 

_f. For the purposes of this project reservoir hypsographic 
curves can be conveniently summarized using low-order 
polynomial equations relating the logarithm of total 
volume to the logarithm of total depth. Errors charac¬ 
teristic of this curve-fitting scheme are similar to 
those characteristic of direct interpolation methods. 


i 





Within-reservoir variations of trophic state indicators 
generally cannot be considered random with respect to 
station and date, the primary parameters used in monitor¬ 
ing program design. Spatial variations contain informa¬ 
tion on rates and directions of processes controlling 
eutrophication within reservoirs. Variance component 
analyses indicate that phosphorus and transparency have 
variance structures which are similar to each other, but 
fundamentally different from the variance structure of 
chlorophyll-a. Implications of these variance structures 
for monitoring and modelling efforts have been discussed. 

A method for estimating the mean and error variance of 
nutrient loadings derived from continuous-flow and grab- 
sample-concentration measurements has been developed and 
tested using real and simulated flow/concentration time 
series. The method employs a regression model relating 
concentration to flow and has been shown to compare 
favorably with alternative calculation methods with 
respect to bias and variance of loading estimates under 
a range of conditions. 











PART XVI: RECOMMENDATIONS 


149. The following additional data base development work is re¬ 
quired in order to permit testing of empirical eutrophication models 
and should therefore be included in the scope of Phase II of this study: 

a. corqailation of monthly elevation/contents data from many 
districts and projects, as indicated by the inventories 
in Appendix A. 

b. development of a scheme for sorting of water quality 
stations in downstream order within reservoirs. 

150. While compiled for the specific purposes of this project, the 
data base described in this report could be adapted for other, more 
generalized uses. As discussed in the introduction to this report, the 
current data base consists of a number of computer files, reports, maps, 
and data forms organized in a consistent framework. It is not a system 
designed for frequent interactive use. Given the current data base, the 
development of such a system would involve the following: 

a. definition of objectives (desired scope of data base uses). 

b. definition of operating environment (direct users, main- 
tainence personnel, and computer system). 

c. development of appropriate software for various purposes 
(e.g., accessing, updating, editing, summarizing, dis¬ 
playing, analyzing). 

d. compilation of any additional data required for intended 
uses of the data base (e.g., inclusion of priority pollut¬ 
ants, development of a digital reservoir mapping capa¬ 
bility) . 

e_. appropriate modifications in file designs. 

JF. establishment of channels and procedures for updating and 
verification of information. 

£. testing of the system in an operational environment. 

h. documentation of the system. 

i^. orientation of potential users. 

It is recommended that the Corps of Engineers consider expanding upon the 
existing data base, as outlined above, in order to permit more generalized 
use of the information which has been compiled under this project. The 
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APPENDIX A 

DATA INVENTORIES BY PROJECT AND DIVISION 
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